Go to file
Yiqing-Zhou 8a5e2043bb [optimize] map_location='cpu' for load_from_checkpoint 2023-05-09 00:37:52 +08:00
.vscode Initial Commit 2023-05-04 21:52:25 +08:00
.gitignore [code] refactor 2023-05-07 13:01:02 +08:00
LICENSE Create LICENSE 2023-05-08 00:26:38 +08:00
README.md [feature] export model checkpoint from pl.LightningModule 2023-05-07 13:18:58 +08:00
generate.py [code] refactor 2023-05-07 13:01:02 +08:00
lit_export.py [optimize] map_location='cpu' for load_from_checkpoint 2023-05-09 00:37:52 +08:00
lit_module.py [feature] new args learning_rate max_epochs 2023-05-09 00:35:28 +08:00
lit_patches.py [fix] add patch to fix FSDPStrategy checkpoint issue 2023-05-07 16:51:57 +08:00
lit_train.py [feature] new args learning_rate max_epochs 2023-05-09 00:35:28 +08:00
requirements.txt [code] update requirements 2023-05-06 21:05:53 +08:00
utils.py [code] refactor 2023-05-07 13:01:02 +08:00

README.md

GPT-Pretrain

Usage

python lit_train.py --model_name gpt2
python lit_export.py --version 0
python generate.py --model_name_or_path exports/version_0 --tokenizer_name_or_path gpt2