Colin
|
b7c27af6c8
|
Add research_token to dump token relationship in attention layer0.
|
2024-01-29 00:12:08 +08:00 |
Colin
|
185278f3a9
|
Update research_attention dump without sum.
|
2024-01-28 17:55:08 +08:00 |
Colin
|
3f296ccdb2
|
Update research.
|
2024-01-26 20:35:25 +08:00 |
Colin
|
bba27e3444
|
Refine prepareInput.
|
2024-01-25 18:05:08 +08:00 |
Colin
|
19491d1f4a
|
Refine model of qwen.
|
2024-01-24 21:26:19 +08:00 |
Colin
|
11af10e710
|
Refine research_attention and forward model.
|
2024-01-23 13:13:21 +08:00 |
Colin
|
1811b9611a
|
Refine research_attention.
|
2024-01-22 20:57:27 +08:00 |
Colin
|
5dbac40925
|
Refien.
|
2024-01-21 22:43:16 +08:00 |
Colin
|
17a2df2e6f
|
Update show and q@k dump.
|
2024-01-21 20:50:36 +08:00 |
Colin
|
ae6ea67bbe
|
Refine qwen/research_attention.py.
|
2024-01-21 17:54:05 +08:00 |
Colin
|
dab1c94bc6
|
Refine qwen to module fomater.
|
2024-01-21 16:47:54 +08:00 |