Commit Graph

19 Commits

Author SHA1 Message Date
Colin 0ae63298b2 use custom vocab_size. 2024-03-14 13:28:40 +08:00
Colin 05f17b1221 Refine model config and init. 2024-03-14 11:40:26 +08:00
Colin 8330cbb036 Add meaning dataset. 2024-03-13 19:41:02 +08:00
Colin c094afb0f9 Add tensorboard event out. 2024-03-09 16:55:03 +08:00
Colin f1394d5974 Refine code. 2024-03-08 20:46:42 +08:00
Colin 601c7f6510 Retest wit. 2024-03-07 16:30:37 +08:00
Colin a70d12d04d Rename train file. 2024-03-05 22:09:58 +08:00
Colin 9ef3e92b23 Try model train. 2024-03-05 22:09:28 +08:00
Colin 11fc8f1d39 Refine label used. 2024-03-05 22:08:37 +08:00
Colin cf726a5b9f Add loss and logger code. 2024-03-05 15:54:03 +08:00
Colin 9e8e92ae25 Update trainer to custom data. 2024-03-04 21:41:46 +08:00
Colin 1622bf3054 add mnbvc dataset . 2024-03-03 23:35:40 +08:00
Colin 8120be66a6 sperate train and val dataset. 2024-02-26 23:59:00 +08:00
Colin d1906629ab Enable wit train on cutome dataset and loss down. 2024-02-26 22:44:26 +08:00
Colin 1ef3e419cb Add custom dataset support. 2024-02-26 22:44:26 +08:00
Colin e5f97af291 Add wit train support. 2024-02-26 22:44:26 +08:00
Colin fc071dce70 Remove no use tiktoken. 2024-02-21 21:11:15 +08:00
Colin fe13f12327 Add wit. 2024-02-06 14:08:45 +08:00
Colin 9d5d590b09 Add dataset and wit. 2024-02-04 23:48:24 +08:00