Commit Graph

10 Commits

Author SHA1 Message Date
Colin 122cbd9ff8 Use local tokenizer. 2024-02-24 14:14:12 +08:00
Colin ac61c4d925 set use local dataset. 2024-02-24 13:44:22 +08:00
Colin 087366c59b init gpt train without download. 2024-02-24 13:40:39 +08:00
Colin b992ae99fa Train on wiki data. 2024-02-24 12:06:30 +08:00
Colin 7d16743184 enable pretrain. 2024-02-22 15:03:32 +08:00
yiqing-zhou b76d333f39 [code] formatter-caused changes 2023-05-28 20:02:56 +08:00
Yiqing-Zhou 5e6b747baf [fix] add patch to fix DeepSpeedStrategy offload 'zero_force_ds_cpu_optimizer' issue 2023-05-09 23:00:28 +08:00
Yiqing-Zhou 3f92bbbaa2 [feature] new args learning_rate max_epochs 2023-05-09 00:35:28 +08:00
Yiqing-Zhou 70ff2acaf0 [fix] add patch to fix FSDPStrategy checkpoint issue 2023-05-07 16:51:57 +08:00
Yiqing-Zhou 09507449f7 [code] refactor 2023-05-07 13:01:02 +08:00