Merge pull request #226 from MrYxJ/patch-1

Fix a typo in the formula of ALiBi.
2025-08-14 09:31:42 +08:00 · 2023-11-17 17:42:17 +00:00
parent 07b5782a48 830161b299
commit 36a374ed76
1 changed files with 1 additions and 1 deletions
--- a/labml_nn/transformers/alibi/init.py
+++ b/labml_nn/transformers/alibi/init.py
@ -19,7 +19,7 @@ Here's the attention formula for $i$-th token,

 \begin{align}
 \mathbf{a}_i
-&= \text{softmax} \bigg( \mathbf{q}_i \mathbf{K}^\top + m \cdot \big[-(i-1), \dots, 1, 0 \big] \bigg) \\
+&= \text{softmax} \bigg( \mathbf{q}_i \mathbf{K}^\top + m \cdot \big[-(i-1), \dots, -1, 0 \big] \bigg) \\
 &= \text{softmax} \bigg( \mathbf{q}_i \mathbf{K}^\top + m \cdot \big[0, 1, \dots, (i - 1) \big] \bigg)
 \end{align}