learning/annotated_deep_learning_paper_implementations

mirror of https://github.com/labmlai/annotated_deep_learning_paper_implementations.git synced 2025-10-31 18:58:43 +08:00

Files

History

Varuna Jayasiri 9a42ac2697 arxiv.org links

2023-10-24 14:42:32 +01:00

..

__init__.py

arxiv.org links

2023-10-24 14:42:32 +01:00

experiment.py

remove app.labml.ai links

2023-04-02 12:10:18 +05:30

readme.md

arxiv.org links

2023-10-24 14:42:32 +01:00

readme.md

Pay Attention to MLPs (gMLP)

This is a PyTorch implementation of the paper Pay Attention to MLPs.

This paper introduces a Multilayer Perceptron (MLP) based architecture with gating, which they name gMLP. It consists of a stack of L gMLP blocks.

Here is the training code for a gMLP model based autoregressive model.