Files
Varuna Jayasiri 07065dea92 CFR (#60)
2021-06-21 17:04:20 +05:30
..
2021-06-21 17:04:20 +05:30
2021-06-02 21:36:47 +05:30
2021-05-26 10:56:42 +05:30

FNet: Mixing Tokens with Fourier Transforms

This is a PyTorch implementation of the paper FNet: Mixing Tokens with Fourier Transforms.

This paper replaces the self-attention layer with two Fourier transforms to mix tokens. This is a 7X more efficient than self-attention. The accuracy loss of using this over self-attention is about 92% for BERT on GLUE benchmark.