mirror of
				https://github.com/labmlai/annotated_deep_learning_paper_implementations.git
				synced 2025-10-31 18:58:43 +08:00 
			
		
		
		
	Pay Attention to MLPs (gMLP)
This is a PyTorch implementation of the paper Pay Attention to MLPs.
This paper introduces a Multilayer Perceptron (MLP) based architecture with gating,
which they name gMLP. It consists of a stack of L gMLP blocks.
Here is the training code for a gMLP model based autoregressive model.
