links

2025-11-01 03:43:09 +08:00 · 2021-02-05 19:20:17 +05:30
parent 8168b04440
commit 13f36c18f6
6 changed files with 12 additions and 20 deletions
--- a/labml_nn/init.py
+++ b/labml_nn/init.py
@ -15,12 +15,9 @@ implementations.

 #### ✨ [Transformers](transformers/index.html)

-[Transformers module](transformers/index.html)
-contains implementations for
-[multi-headed attention](transformers/mha.html)
-and
-[relative multi-headed attention](transformers/relative_mha.html).
-
+* [Multi-headed attention](transformers/mha.html)
+* [Transformer building blocks](transformers/models.html)
+* [Relative multi-headed attention](transformers/xl/relative_mha.html).
 * [GPT Architecture](transformers/gpt/index.html)
 * [GLU Variants](transformers/glu_variants/simple.html)
 * [kNN-LM: Generalization through Memorization](transformers/knn/index.html)
--- a/labml_nn/transformers/init.py
+++ b/labml_nn/transformers/init.py
@ -14,7 +14,7 @@ from paper [Attention Is All You Need](https://arxiv.org/abs/1706.03762),
 and derivatives and enhancements of it.

 * [Multi-head attention](mha.html)
-* [Relative multi-head attention](relative_mha.html)
+* [Relative multi-head attention](xl/relative_mha.html)
 * [Transformer Encoder and Decoder Models](models.html)
 * [Fixed positional encoding](positional_encoding.html)