Files
Varuna Jayasiri 996b58be04 paper links
2021-08-17 14:12:33 +05:30
..
2021-08-17 14:12:33 +05:30
2021-06-02 21:36:47 +05:30
2021-08-17 14:12:33 +05:30

FNet: Mixing Tokens with Fourier Transforms

This is a PyTorch implementation of the paper FNet: Mixing Tokens with Fourier Transforms.

This paper replaces the self-attention layer with two Fourier transforms to mix tokens. This is a 7X more efficient than self-attention. The accuracy loss of using this over self-attention is about 92% for BERT on GLUE benchmark.