24 Commits

Author SHA1 Message Date
cf565bcc1d cleanup 2024-06-18 11:09:02 +05:30
9a42ac2697 arxiv.org links 2023-10-24 14:42:32 +01:00
594b525e9d rm comet 2022-09-07 09:31:38 +05:30
b6bef1d2fe cleanup 2022-07-02 14:31:16 +05:30
ab4264cbda comet links fix 2022-07-02 14:25:27 +05:30
ee5a34aa59 experiment links transformer 2022-06-28 19:02:20 +05:30
dc4da2106b headings 2021-10-21 15:15:05 +05:30
c5b13162cf anchors 2021-10-21 15:01:58 +05:30
996b58be04 paper links 2021-08-17 14:12:33 +05:30
ab6d729bb3 AFT (#54) 2021-06-02 21:36:47 +05:30
d3790d708b use forward 2021-02-02 10:22:22 +05:30
88d0f89ef5 ✍️ english 2021-02-01 07:33:01 +05:30
75069d83e3 ffn glu variants configs & 🐛 gated ffn 2021-01-25 09:44:38 +05:30
0bf8951258 FFN ready for GLU 2021-01-24 09:31:53 +05:30
9a50322211 📚 switch transformer notes 2021-01-20 10:58:57 +05:30
5d174f3a7c 📚 switch transformer notes 2021-01-20 10:14:21 +05:30
4c769128cb autoregression trainer notes 2021-01-14 08:46:41 +05:30
774a72e0bf gpt 2021-01-13 12:54:43 +05:30
1f75f42fb2 yaml configs 2020-12-10 10:05:06 +05:30
1dd3840e22 autoregression transformer 2020-11-06 10:54:23 +05:30
6b4b9b2e39 titles 2020-10-23 15:06:55 +05:30
1b0f8944bd spelling 2020-09-05 13:25:27 +05:30
dac710bc2f model & positional encodings annotations 2020-09-05 11:39:09 +05:30
89ca5604be ♻️ models and cofigs 2020-09-04 14:01:29 +05:30