|
9262c57f18
|
flash attention
|
2025-08-08 19:57:57 +05:30 |
|
|
5731bff586
|
LoRA docs
|
2024-08-24 10:50:02 +05:30 |
|
|
cf565bcc1d
|
cleanup
|
2024-06-18 11:09:02 +05:30 |
|
|
a0679ecd90
|
title
|
2024-01-12 13:21:54 +05:30 |
|
|
4135eda943
|
title
|
2024-01-12 13:15:04 +05:30 |
|
|
0101618de6
|
links
|
2023-07-14 21:27:44 +05:30 |
|
|
ce0cdb676f
|
ja translation
|
2023-06-30 16:03:07 +05:30 |
|
|
d4c3335525
|
link to translations
|
2023-04-02 14:03:23 +05:30 |
|
|
97e53c0f3d
|
fix glu variants links
|
2023-04-02 12:00:23 +05:30 |
|
|
4c47086b0f
|
links
|
2022-09-15 14:55:48 +05:30 |
|
|
db4dd4905b
|
links
|
2022-08-20 13:41:51 +05:30 |
|
|
980a84ed4f
|
Zero3 memory optimizations (#140)
|
2022-08-11 15:44:13 +05:30 |
|
|
4cf1d74e6d
|
sampling links
|
2022-08-08 12:27:11 +05:30 |
|
|
948e436854
|
U-Net (#137)
|
2022-07-25 16:55:06 +05:30 |
|
|
72669d0526
|
ALiBi (#134)
|
2022-07-17 09:28:32 +05:30 |
|
|
6a41c82b30
|
FTA (#115)
|
2022-05-23 22:26:39 +05:30 |
|
|
bf0c4e36d7
|
highlighted paper links
|
2022-05-03 09:30:50 +01:00 |
|
|
0aff72f970
|
DeepNorm (#114)
|
2022-04-10 08:08:55 +05:30 |
|
|
1536c6ec5e
|
links
|
2022-03-12 15:51:10 +05:30 |
|
|
d747e463a4
|
rope links
|
2022-02-23 15:10:49 +05:30 |
|
|
c1a1556c6e
|
url fix
|
2021-11-06 14:36:50 +05:30 |
|
|
3a54a2099d
|
links
|
2021-11-06 14:29:38 +05:30 |
|
|
8d7b7c1730
|
light
|
2021-10-16 14:34:37 +05:30 |
|
|
72a197542c
|
dark photo
|
2021-10-16 14:33:37 +05:30 |
|
|
6464269933
|
dqn images
|
2021-10-16 14:30:41 +05:30 |
|
|
f8abd6a276
|
Conv mixer (#100)
|
2021-10-14 18:41:37 +05:30 |
|
|
e2be5ddf35
|
Denoising Diffusion Probabilistic Models (#98)
|
2021-10-08 21:33:04 +05:30 |
|
|
58cda113e0
|
primer links
|
2021-09-21 16:06:34 +05:30 |
|
|
b6607524b8
|
Evidential Deep Learning to Quantify Classification Uncertainty (#85)
|
2021-08-21 10:25:32 +05:30 |
|
|
9bef456004
|
PonderNet (#76)
|
2021-08-12 15:45:01 +05:30 |
|
|
e38f9af968
|
repo name
|
2021-08-08 08:32:39 +05:30 |
|
|
671a93c299
|
GATv2 refactoring (#70)
* fixed link, add clarification
* updated dropout + experiment link
|
2021-07-26 13:52:54 +05:30 |
|
|
800df5c40c
|
GATv2 (#69)
|
2021-07-25 19:30:59 +05:30 |
|
|
f038ab673d
|
vit
|
2021-07-17 15:24:17 +05:30 |
|
|
cff11612a0
|
readme
|
2021-07-16 08:46:12 +05:30 |
|
|
901a74411d
|
GAT (#67)
|
2021-07-08 18:24:34 +05:30 |
|
|
1a9f15eebb
|
Distillation (#65)
|
2021-07-03 14:01:17 +05:30 |
|
|
e2bbc63001
|
links
|
2021-06-21 18:35:22 +05:30 |
|
|
07065dea92
|
CFR (#60)
|
2021-06-21 17:04:20 +05:30 |
|
|
f6e430a9b3
|
MLP Mixer (#59)
|
2021-06-16 09:36:13 +05:30 |
|
|
e69a12c78c
|
gMLP (#57)
|
2021-06-07 14:57:54 +05:30 |
|
|
ebce404402
|
Masked Language Model (#56)
|
2021-06-06 15:12:11 +05:30 |
|
|
ab6d729bb3
|
AFT (#54)
|
2021-06-02 21:36:47 +05:30 |
|
|
7f05ed043b
|
FNet (#53)
|
2021-05-26 10:56:42 +05:30 |
|
|
d449497222
|
readme
|
2021-05-21 15:05:02 +05:30 |
|
|
27da7005a4
|
gp links
|
2021-05-09 15:42:46 +05:30 |
|
|
1c4fb26cde
|
title
|
2021-05-07 18:29:03 +05:30 |
|
|
cb6f63f2a1
|
readme
|
2021-05-07 16:45:38 +05:30 |
|
|
b710f5d295
|
link fix
|
2021-04-28 12:10:13 +05:30 |
|
|
9068106f5d
|
links
|
2021-04-28 10:47:05 +05:30 |
|