This commit is contained in:
Varuna Jayasiri
2021-11-06 14:29:38 +05:30
parent 95bc3544b2
commit 3a54a2099d
5 changed files with 15 additions and 5 deletions

View File

@ -36,6 +36,7 @@ implementations.
* [Pay Attention to MLPs (gMLP)](transformers/gmlp/index.html)
* [Vision Transformer (ViT)](transformers/vit/index.html)
* [Primer EZ](transformers/primer_ez/index.html)
* [Hourglass](transformers/hourglass/index.html)
#### ✨ [Recurrent Highway Networks](recurrent_highway_networks/index.html)

View File

@ -93,6 +93,11 @@ This is an implementation of the paper
This is an implementation of the paper
[Primer: Searching for Efficient Transformers for Language Modeling](https://papers.labml.ai/paper/2109.08668).
## [Hourglass](hour_glass/index.html)
This is an implementation of the paper
[Hierarchical Transformers Are More Efficient Language Models](https://papers.labml.ai/paper/2110.13711)
"""
from .configs import TransformerConfigs