Colin
|
b7c27af6c8
|
Add research_token to dump token relationship in attention layer0.
|
2024-01-29 00:12:08 +08:00 |
Colin
|
185278f3a9
|
Update research_attention dump without sum.
|
2024-01-28 17:55:08 +08:00 |
Colin
|
3f296ccdb2
|
Update research.
|
2024-01-26 20:35:25 +08:00 |
Colin
|
bba27e3444
|
Refine prepareInput.
|
2024-01-25 18:05:08 +08:00 |
Colin
|
19491d1f4a
|
Refine model of qwen.
|
2024-01-24 21:26:19 +08:00 |
Colin
|
11af10e710
|
Refine research_attention and forward model.
|
2024-01-23 13:13:21 +08:00 |
Colin
|
1811b9611a
|
Refine research_attention.
|
2024-01-22 20:57:27 +08:00 |
Colin
|
5dbac40925
|
Refien.
|
2024-01-21 22:43:16 +08:00 |
Colin
|
17a2df2e6f
|
Update show and q@k dump.
|
2024-01-21 20:50:36 +08:00 |
Colin
|
ae6ea67bbe
|
Refine qwen/research_attention.py.
|
2024-01-21 17:54:05 +08:00 |
Colin
|
dab1c94bc6
|
Refine qwen to module fomater.
|
2024-01-21 16:47:54 +08:00 |
Colin
|
9d28280cb1
|
Refine model of qwen and add runner.
|
2024-01-21 12:45:56 +08:00 |
Colin
|
7c047f0b32
|
Refine model of qwen.
|
2024-01-21 02:33:55 +08:00 |
Colin
|
40ae899515
|
Refine model of qwen.
|
2024-01-20 23:01:09 +08:00 |
Colin
|
4d493014ba
|
Refine model of qwen.
|
2024-01-20 20:20:18 +08:00 |
Colin
|
12dcbec718
|
PreTrainedModel to mm.Module
|
2024-01-20 20:06:59 +08:00 |
Colin
|
0458e7303c
|
Remove attention_mask
|
2024-01-20 18:08:20 +08:00 |
Colin
|
cd50c10e8c
|
Move readme to charglm.
|
2024-01-20 00:11:12 +08:00 |
Colin
|
e7ba788982
|
Delete docs.
|
2024-01-20 00:10:27 +08:00 |
colin
|
69154a4777
|
删除 doc/主观意识生成对话.md
|
2024-01-19 18:22:50 +08:00 |
colin
|
fd0b0c63ba
|
删除 chatglm/graph.md
|
2024-01-19 18:22:39 +08:00 |
Colin
|
f96bcc799c
|
Refine model of qwen for long sequence in eval.
|
2024-01-19 14:54:48 +08:00 |
Colin
|
45c2f532ff
|
Add mem_tracker in tools.
|
2024-01-19 14:52:28 +08:00 |
Colin
|
3233616aac
|
Delete kv cache of qwen.
|
2024-01-18 20:23:21 +08:00 |
Colin
|
0a78627e48
|
Add doc
|
2024-01-17 22:56:30 +08:00 |
Colin
|
90fbc2642e
|
Refine modeling and demo.
|
2024-01-14 17:21:14 +08:00 |
Colin
|
332d27cc05
|
Delete unused files.
|
2024-01-14 15:42:46 +08:00 |
Colin
|
fb276cdeea
|
Add test markdown for document.
|
2024-01-14 14:28:45 +08:00 |
Colin
|
d13f7e6c57
|
Format model of qwen.
|
2024-01-13 17:16:43 +08:00 |
Colin
|
5cf6e8b013
|
Refine qwen model.
|
2024-01-13 16:50:25 +08:00 |
Colin
|
9386d044b6
|
Update tools of show.
|
2024-01-13 16:48:56 +08:00 |
Colin
|
063f722177
|
Refine model of qwen.
|
2024-01-11 07:00:18 +00:00 |
Colin
|
7d7b4381f8
|
Update qwen model.
|
2024-01-10 13:16:54 +00:00 |
Colin
|
245d251663
|
Refine chat output format.
|
2024-01-10 11:35:46 +00:00 |
Colin
|
1b8007e1c3
|
Refine train.
|
2024-01-10 05:22:26 +00:00 |
Colin
|
69cb525ab0
|
Refine model of qwen.
|
2024-01-07 22:49:21 +08:00 |
Colin
|
94ecf0f561
|
Refine model of qwen.
|
2024-01-07 22:36:55 +08:00 |
Colin
|
4c0991a409
|
Re format qwen.
|
2024-01-07 21:54:37 +08:00 |
Colin
|
aa2d3b96c4
|
Delete unused files.
|
2024-01-07 21:43:02 +08:00 |
Colin
|
82ac3e4863
|
Refine model of qwen.
|
2024-01-07 17:50:58 +08:00 |
Colin
|
3f8ea9db07
|
Remote return_dict_in_generate
|
2024-01-07 17:32:24 +08:00 |
Colin
|
a8f2fbbff5
|
Remote return_dict config. Remove unuse files.
|
2024-01-07 17:28:15 +08:00 |
Colin
|
90cb0fe236
|
Refine model of qwen.
|
2024-01-07 16:53:53 +08:00 |
Colin
|
611396b656
|
Format qwen model.
|
2024-01-07 16:23:04 +08:00 |
Colin
|
255a2ff71c
|
Update qwen model.
|
2024-01-07 16:22:41 +08:00 |
Colin
|
f6538c1111
|
Update qwen model add generator and sample.
|
2024-01-07 16:15:27 +08:00 |
Colin
|
08f7b75efe
|
Add train resnet test.
|
2024-01-07 15:09:45 +08:00 |
Colin
|
467c78d83d
|
Update and try seqgpt
|
2024-01-07 15:06:39 +08:00 |
Colin
|
0dd2f2bab4
|
add seqgpt and prompt_clue
|
2024-01-06 21:05:39 +08:00 |
Colin
|
65578680cf
|
Add prompt_clue.
|
2024-01-05 20:33:01 +08:00 |