Files
Varuna Jayasiri 194adc6d68 stochastic deptch
2021-06-07 15:08:55 +05:30
..
2021-06-07 14:57:54 +05:30
2021-06-07 15:08:55 +05:30
2021-06-07 14:57:54 +05:30

Pay Attention to MLPs (gMLP)

This is a PyTorch implementation of the paper Pay Attention to MLPs.

This paper introduces a Multilayer Perceptron (MLP) based architecture with gating, which they name gMLP. It consists of a stack of L gMLP blocks.

Here is the training code for a gMLP model based autoregressive model.

View Run