Colin
|
087366c59b
|
init gpt train without download.
|
2024-02-24 13:40:39 +08:00 |
Colin
|
b992ae99fa
|
Train on wiki data.
|
2024-02-24 12:06:30 +08:00 |
Colin
|
7d16743184
|
enable pretrain.
|
2024-02-22 15:03:32 +08:00 |
yiqing-zhou
|
b76d333f39
|
[code] formatter-caused changes
|
2023-05-28 20:02:56 +08:00 |
Yiqing-Zhou
|
5e6b747baf
|
[fix] add patch to fix DeepSpeedStrategy offload 'zero_force_ds_cpu_optimizer' issue
|
2023-05-09 23:00:28 +08:00 |
Yiqing-Zhou
|
3f92bbbaa2
|
[feature] new args learning_rate max_epochs
|
2023-05-09 00:35:28 +08:00 |
Yiqing-Zhou
|
70ff2acaf0
|
[fix] add patch to fix FSDPStrategy checkpoint issue
|
2023-05-07 16:51:57 +08:00 |
Yiqing-Zhou
|
09507449f7
|
[code] refactor
|
2023-05-07 13:01:02 +08:00 |