Colin
|
cf726a5b9f
|
Add loss and logger code.
|
2024-03-05 15:54:03 +08:00 |
Colin
|
9e8e92ae25
|
Update trainer to custom data.
|
2024-03-04 21:41:46 +08:00 |
Colin
|
1622bf3054
|
add mnbvc dataset .
|
2024-03-03 23:35:40 +08:00 |
Colin
|
8120be66a6
|
sperate train and val dataset.
|
2024-02-26 23:59:00 +08:00 |
Colin
|
d1906629ab
|
Enable wit train on cutome dataset and loss down.
|
2024-02-26 22:44:26 +08:00 |
Colin
|
1ef3e419cb
|
Add custom dataset support.
|
2024-02-26 22:44:26 +08:00 |
Colin
|
e5f97af291
|
Add wit train support.
|
2024-02-26 22:44:26 +08:00 |
Colin
|
fc071dce70
|
Remove no use tiktoken.
|
2024-02-21 21:11:15 +08:00 |
Colin
|
fe13f12327
|
Add wit.
|
2024-02-06 14:08:45 +08:00 |
Colin
|
6366b52fef
|
Add reaserch sile resault.
|
2024-02-04 23:48:51 +08:00 |
Colin
|
9d5d590b09
|
Add dataset and wit.
|
2024-02-04 23:48:24 +08:00 |
Colin
|
b7c27af6c8
|
Add research_token to dump token relationship in attention layer0.
|
2024-01-29 00:12:08 +08:00 |
Colin
|
185278f3a9
|
Update research_attention dump without sum.
|
2024-01-28 17:55:08 +08:00 |
Colin
|
3f296ccdb2
|
Update research.
|
2024-01-26 20:35:25 +08:00 |
Colin
|
bba27e3444
|
Refine prepareInput.
|
2024-01-25 18:05:08 +08:00 |
Colin
|
19491d1f4a
|
Refine model of qwen.
|
2024-01-24 21:26:19 +08:00 |
Colin
|
11af10e710
|
Refine research_attention and forward model.
|
2024-01-23 13:13:21 +08:00 |
Colin
|
1811b9611a
|
Refine research_attention.
|
2024-01-22 20:57:27 +08:00 |
Colin
|
5dbac40925
|
Refien.
|
2024-01-21 22:43:16 +08:00 |
Colin
|
17a2df2e6f
|
Update show and q@k dump.
|
2024-01-21 20:50:36 +08:00 |
Colin
|
ae6ea67bbe
|
Refine qwen/research_attention.py.
|
2024-01-21 17:54:05 +08:00 |
Colin
|
dab1c94bc6
|
Refine qwen to module fomater.
|
2024-01-21 16:47:54 +08:00 |
Colin
|
9d28280cb1
|
Refine model of qwen and add runner.
|
2024-01-21 12:45:56 +08:00 |
Colin
|
7c047f0b32
|
Refine model of qwen.
|
2024-01-21 02:33:55 +08:00 |
Colin
|
40ae899515
|
Refine model of qwen.
|
2024-01-20 23:01:09 +08:00 |
Colin
|
4d493014ba
|
Refine model of qwen.
|
2024-01-20 20:20:18 +08:00 |
Colin
|
12dcbec718
|
PreTrainedModel to mm.Module
|
2024-01-20 20:06:59 +08:00 |
Colin
|
0458e7303c
|
Remove attention_mask
|
2024-01-20 18:08:20 +08:00 |
Colin
|
cd50c10e8c
|
Move readme to charglm.
|
2024-01-20 00:11:12 +08:00 |
Colin
|
e7ba788982
|
Delete docs.
|
2024-01-20 00:10:27 +08:00 |
colin
|
69154a4777
|
删除 doc/主观意识生成对话.md
|
2024-01-19 18:22:50 +08:00 |
colin
|
fd0b0c63ba
|
删除 chatglm/graph.md
|
2024-01-19 18:22:39 +08:00 |
Colin
|
f96bcc799c
|
Refine model of qwen for long sequence in eval.
|
2024-01-19 14:54:48 +08:00 |
Colin
|
45c2f532ff
|
Add mem_tracker in tools.
|
2024-01-19 14:52:28 +08:00 |
Colin
|
3233616aac
|
Delete kv cache of qwen.
|
2024-01-18 20:23:21 +08:00 |
Colin
|
0a78627e48
|
Add doc
|
2024-01-17 22:56:30 +08:00 |
Colin
|
90fbc2642e
|
Refine modeling and demo.
|
2024-01-14 17:21:14 +08:00 |
Colin
|
332d27cc05
|
Delete unused files.
|
2024-01-14 15:42:46 +08:00 |
Colin
|
fb276cdeea
|
Add test markdown for document.
|
2024-01-14 14:28:45 +08:00 |
Colin
|
d13f7e6c57
|
Format model of qwen.
|
2024-01-13 17:16:43 +08:00 |
Colin
|
5cf6e8b013
|
Refine qwen model.
|
2024-01-13 16:50:25 +08:00 |
Colin
|
9386d044b6
|
Update tools of show.
|
2024-01-13 16:48:56 +08:00 |
Colin
|
063f722177
|
Refine model of qwen.
|
2024-01-11 07:00:18 +00:00 |
Colin
|
7d7b4381f8
|
Update qwen model.
|
2024-01-10 13:16:54 +00:00 |
Colin
|
245d251663
|
Refine chat output format.
|
2024-01-10 11:35:46 +00:00 |
Colin
|
1b8007e1c3
|
Refine train.
|
2024-01-10 05:22:26 +00:00 |
Colin
|
69cb525ab0
|
Refine model of qwen.
|
2024-01-07 22:49:21 +08:00 |
Colin
|
94ecf0f561
|
Refine model of qwen.
|
2024-01-07 22:36:55 +08:00 |
Colin
|
4c0991a409
|
Re format qwen.
|
2024-01-07 21:54:37 +08:00 |
Colin
|
aa2d3b96c4
|
Delete unused files.
|
2024-01-07 21:43:02 +08:00 |