rope links

This commit is contained in:
Varuna Jayasiri
2022-02-23 15:10:49 +05:30
parent cd03271fb6
commit d747e463a4
3 changed files with 5 additions and 0 deletions

View File

@ -22,6 +22,7 @@ implementations.
* [Transformer building blocks](transformers/models.html)
* [Transformer XL](transformers/xl/index.html)
* [Relative multi-headed attention](transformers/xl/relative_mha.html)
* [Rotary Positional Embeddings](transformers/rope/index.html)
* [Compressive Transformer](transformers/compressive/index.html)
* [GPT Architecture](transformers/gpt/index.html)
* [GLU Variants](transformers/glu_variants/simple.html)

View File

@ -22,6 +22,9 @@ and derivatives and enhancements of it.
This implements Transformer XL model using
[relative multi-head attention](xl/relative_mha.html)
## [Rotary Positional Embeddings](rope/index.html)
This implements Rotary Positional Embeddings (RoPE)
## [Compressive Transformer](compressive/index.html)
This is an implementation of compressive transformer

View File

@ -24,6 +24,7 @@ implementations almost weekly.
* [Transformer building blocks](https://nn.labml.ai/transformers/models.html)
* [Transformer XL](https://nn.labml.ai/transformers/xl/index.html)
* [Relative multi-headed attention](https://nn.labml.ai/transformers/xl/relative_mha.html)
* [Rotary Positional Embeddings](https://nn.labml.ai/transformers/rope/index.html)
* [Compressive Transformer](https://nn.labml.ai/transformers/compressive/index.html)
* [GPT Architecture](https://nn.labml.ai/transformers/gpt/index.html)
* [GLU Variants](https://nn.labml.ai/transformers/glu_variants/simple.html)