更新 Readme.md
This commit is contained in:
parent
d9b64e4025
commit
1579e0b8f2
|
@ -144,7 +144,7 @@ return Linear(context_layer) -> [6, 1, 4096]
|
||||||
```
|
```
|
||||||
|
|
||||||
## GLMBlock
|
## GLMBlock
|
||||||
|
```
|
||||||
input
|
input
|
||||||
| \
|
| \
|
||||||
| RMSNorm
|
| RMSNorm
|
||||||
|
@ -158,5 +158,5 @@ return Linear(context_layer) -> [6, 1, 4096]
|
||||||
| dropout
|
| dropout
|
||||||
| /
|
| /
|
||||||
Add
|
Add
|
||||||
|
```
|
||||||
所有的输出shape都是[6, 1, 4096], 6:sequence_length 1:batch_num 4096:hidden_size
|
所有的输出shape都是[6, 1, 4096], 6:sequence_length 1:batch_num 4096:hidden_size
|
Loading…
Reference in New Issue