Presentation Information

[D-4-01]Light weight Transformer with nonlinear attention

○Riku Matsumoto1, Masaomi Kimura1 (1. Shibaura Institute of Technology)

Keywords:

Natural language processing,Neural networks,Transformer