Colin
|
b7c27af6c8
|
Add research_token to dump token relationship in attention layer0.
|
2024-01-29 00:12:08 +08:00 |
Colin
|
185278f3a9
|
Update research_attention dump without sum.
|
2024-01-28 17:55:08 +08:00 |
Colin
|
3f296ccdb2
|
Update research.
|
2024-01-26 20:35:25 +08:00 |
Colin
|
bba27e3444
|
Refine prepareInput.
|
2024-01-25 18:05:08 +08:00 |
Colin
|
19491d1f4a
|
Refine model of qwen.
|
2024-01-24 21:26:19 +08:00 |
Colin
|
11af10e710
|
Refine research_attention and forward model.
|
2024-01-23 13:13:21 +08:00 |
Colin
|
1811b9611a
|
Refine research_attention.
|
2024-01-22 20:57:27 +08:00 |
Colin
|
5dbac40925
|
Refien.
|
2024-01-21 22:43:16 +08:00 |
Colin
|
17a2df2e6f
|
Update show and q@k dump.
|
2024-01-21 20:50:36 +08:00 |
Colin
|
ae6ea67bbe
|
Refine qwen/research_attention.py.
|
2024-01-21 17:54:05 +08:00 |
Colin
|
dab1c94bc6
|
Refine qwen to module fomater.
|
2024-01-21 16:47:54 +08:00 |
Colin
|
9d28280cb1
|
Refine model of qwen and add runner.
|
2024-01-21 12:45:56 +08:00 |
Colin
|
7c047f0b32
|
Refine model of qwen.
|
2024-01-21 02:33:55 +08:00 |
Colin
|
40ae899515
|
Refine model of qwen.
|
2024-01-20 23:01:09 +08:00 |
Colin
|
4d493014ba
|
Refine model of qwen.
|
2024-01-20 20:20:18 +08:00 |
Colin
|
12dcbec718
|
PreTrainedModel to mm.Module
|
2024-01-20 20:06:59 +08:00 |
Colin
|
0458e7303c
|
Remove attention_mask
|
2024-01-20 18:08:20 +08:00 |
Colin
|
f96bcc799c
|
Refine model of qwen for long sequence in eval.
|
2024-01-19 14:54:48 +08:00 |
Colin
|
3233616aac
|
Delete kv cache of qwen.
|
2024-01-18 20:23:21 +08:00 |
Colin
|
90fbc2642e
|
Refine modeling and demo.
|
2024-01-14 17:21:14 +08:00 |
Colin
|
d13f7e6c57
|
Format model of qwen.
|
2024-01-13 17:16:43 +08:00 |
Colin
|
5cf6e8b013
|
Refine qwen model.
|
2024-01-13 16:50:25 +08:00 |
Colin
|
063f722177
|
Refine model of qwen.
|
2024-01-11 07:00:18 +00:00 |
Colin
|
7d7b4381f8
|
Update qwen model.
|
2024-01-10 13:16:54 +00:00 |
Colin
|
245d251663
|
Refine chat output format.
|
2024-01-10 11:35:46 +00:00 |
Colin
|
69cb525ab0
|
Refine model of qwen.
|
2024-01-07 22:49:21 +08:00 |
Colin
|
94ecf0f561
|
Refine model of qwen.
|
2024-01-07 22:36:55 +08:00 |
Colin
|
4c0991a409
|
Re format qwen.
|
2024-01-07 21:54:37 +08:00 |
Colin
|
aa2d3b96c4
|
Delete unused files.
|
2024-01-07 21:43:02 +08:00 |
Colin
|
82ac3e4863
|
Refine model of qwen.
|
2024-01-07 17:50:58 +08:00 |
Colin
|
3f8ea9db07
|
Remote return_dict_in_generate
|
2024-01-07 17:32:24 +08:00 |
Colin
|
a8f2fbbff5
|
Remote return_dict config. Remove unuse files.
|
2024-01-07 17:28:15 +08:00 |
Colin
|
90cb0fe236
|
Refine model of qwen.
|
2024-01-07 16:53:53 +08:00 |
Colin
|
611396b656
|
Format qwen model.
|
2024-01-07 16:23:04 +08:00 |
Colin
|
255a2ff71c
|
Update qwen model.
|
2024-01-07 16:22:41 +08:00 |
Colin
|
f6538c1111
|
Update qwen model add generator and sample.
|
2024-01-07 16:15:27 +08:00 |
Colin
|
55fed4bc5a
|
Update qwen demo.py
|
2024-01-05 11:49:35 +08:00 |
Colin
|
9deb809a88
|
Update finetune
|
2024-01-04 19:12:28 +08:00 |
Colin
|
ec72ee1141
|
Add finetune
|
2024-01-04 17:36:41 +08:00 |
Colin
|
9b90c607e0
|
Add qwen files.
|
2024-01-03 21:03:27 +08:00 |
Colin
|
3a4e99f7e3
|
Add qwen and refine folders.
|
2024-01-03 20:26:26 +08:00 |