25 Commits

Author SHA1 Message Date
7db6e92376 RWKV (#222)
* rwkv-init

* annotations

* Re-added docs

* make dir if not exist

* Add RWKV paper and update doc index

* add train loop

* experiment

---------

Co-authored-by: Jacob Hatef <hatef.4@buckeyemail.buckeyemail.osu.edu>
Co-authored-by: Quentin Anthony <qganthony@yahoo.com>
2024-03-17 17:36:15 +05:30
e4747dee6d flash attention pdf 2022-07-05 15:15:30 +05:30
1c3992f949 OPT: Open Pre-trained Transformer Language Models 2022-05-04 11:11:25 +01:00
4d4cbefe3e Autoregressive Search Engines Highlighted paper (#121) 2022-04-27 20:31:57 +05:30
ce35cda7e0 Trillion Parameter models highlighted paper (#119) 2022-04-26 01:01:26 +05:30
4c908da357 Adding PDF (#116) 2022-04-19 13:27:00 +05:30
eddf936b3a palm paper 2022-04-15 09:49:45 +01:00
243a581b42 dall-e 2 highlighted pdf 2022-04-12 17:10:21 +01:00
3aaae6ae93 star paper 2022-03-31 15:50:46 +05:30
e88904731f retro paper 2021-12-28 15:59:08 +05:30
4fd9a102a3 nerf pdf 2021-12-07 17:24:22 +05:30
706e7397ea reptile paper 2021-11-16 15:16:08 +05:30
7d80c2716e transformer paper 2021-10-30 12:19:27 +05:30
e309638fea ddpm highlighted paper 2021-10-07 16:08:23 +05:30
ea5439697d primer paper 2021-09-29 19:40:20 +05:30
f81a88fc38 clip paper 2021-09-23 19:02:28 +05:30
a8b8e48c8a The Sensory Neuron as a Transformer highlighted paper 2021-09-18 07:25:39 +05:30
c3ad4bb514 meta-gradients paper 2021-09-08 11:57:10 +05:30
e2516cc306 google maps eta highlighted paper 2021-09-06 13:31:24 +05:30
ff0d5c065d ponder net highlighted paper 2021-08-15 16:58:18 +05:30
79505c4a89 muzero highlighted paper 2021-07-26 19:13:37 +05:30
bd1523e85c paper 2021-07-23 14:10:07 +05:30
e81f44b883 paper pdf 2021-07-17 23:17:42 +05:30
f0bf8d39e4 resnet annotated paper 2021-07-16 08:56:53 +05:30
189770da92 highlighted paper 2021-07-03 14:04:11 +05:30