Commit Graph

42 Commits

Author SHA1 Message Date
Colin 6366b52fef Add reaserch sile resault. 2024-02-04 23:48:51 +08:00
Colin b7c27af6c8 Add research_token to dump token relationship in attention layer0. 2024-01-29 00:12:08 +08:00
Colin 185278f3a9 Update research_attention dump without sum. 2024-01-28 17:55:08 +08:00
Colin 3f296ccdb2 Update research. 2024-01-26 20:35:25 +08:00
Colin bba27e3444 Refine prepareInput. 2024-01-25 18:05:08 +08:00
Colin 19491d1f4a Refine model of qwen. 2024-01-24 21:26:19 +08:00
Colin 11af10e710 Refine research_attention and forward model. 2024-01-23 13:13:21 +08:00
Colin 1811b9611a Refine research_attention. 2024-01-22 20:57:27 +08:00
Colin 5dbac40925 Refien. 2024-01-21 22:43:16 +08:00
Colin 17a2df2e6f Update show and q@k dump. 2024-01-21 20:50:36 +08:00
Colin ae6ea67bbe Refine qwen/research_attention.py. 2024-01-21 17:54:05 +08:00
Colin dab1c94bc6 Refine qwen to module fomater. 2024-01-21 16:47:54 +08:00
Colin 9d28280cb1 Refine model of qwen and add runner. 2024-01-21 12:45:56 +08:00
Colin 7c047f0b32 Refine model of qwen. 2024-01-21 02:33:55 +08:00
Colin 40ae899515 Refine model of qwen. 2024-01-20 23:01:09 +08:00
Colin 4d493014ba Refine model of qwen. 2024-01-20 20:20:18 +08:00
Colin 12dcbec718 PreTrainedModel to mm.Module 2024-01-20 20:06:59 +08:00
Colin 0458e7303c Remove attention_mask 2024-01-20 18:08:20 +08:00
Colin f96bcc799c Refine model of qwen for long sequence in eval. 2024-01-19 14:54:48 +08:00
Colin 3233616aac Delete kv cache of qwen. 2024-01-18 20:23:21 +08:00
Colin 90fbc2642e Refine modeling and demo. 2024-01-14 17:21:14 +08:00
Colin d13f7e6c57 Format model of qwen. 2024-01-13 17:16:43 +08:00
Colin 5cf6e8b013 Refine qwen model. 2024-01-13 16:50:25 +08:00
Colin 063f722177 Refine model of qwen. 2024-01-11 07:00:18 +00:00
Colin 7d7b4381f8 Update qwen model. 2024-01-10 13:16:54 +00:00
Colin 245d251663 Refine chat output format. 2024-01-10 11:35:46 +00:00
Colin 69cb525ab0 Refine model of qwen. 2024-01-07 22:49:21 +08:00
Colin 94ecf0f561 Refine model of qwen. 2024-01-07 22:36:55 +08:00
Colin 4c0991a409 Re format qwen. 2024-01-07 21:54:37 +08:00
Colin aa2d3b96c4 Delete unused files. 2024-01-07 21:43:02 +08:00
Colin 82ac3e4863 Refine model of qwen. 2024-01-07 17:50:58 +08:00
Colin 3f8ea9db07 Remote return_dict_in_generate 2024-01-07 17:32:24 +08:00
Colin a8f2fbbff5 Remote return_dict config. Remove unuse files. 2024-01-07 17:28:15 +08:00
Colin 90cb0fe236 Refine model of qwen. 2024-01-07 16:53:53 +08:00
Colin 611396b656 Format qwen model. 2024-01-07 16:23:04 +08:00
Colin 255a2ff71c Update qwen model. 2024-01-07 16:22:41 +08:00
Colin f6538c1111 Update qwen model add generator and sample. 2024-01-07 16:15:27 +08:00
Colin 55fed4bc5a Update qwen demo.py 2024-01-05 11:49:35 +08:00
Colin 9deb809a88 Update finetune 2024-01-04 19:12:28 +08:00
Colin ec72ee1141 Add finetune 2024-01-04 17:36:41 +08:00
Colin 9b90c607e0 Add qwen files. 2024-01-03 21:03:27 +08:00
Colin 3a4e99f7e3 Add qwen and refine folders. 2024-01-03 20:26:26 +08:00