Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
THUDM
/
chatglm-6b
like
2.86k
Follow
Z.ai & THUKEG
2.4k
Transformers
PyTorch
Chinese
English
chatglm
glm
thudm
custom_code
arxiv:
2103.10360
arxiv:
2210.02414
arxiv:
2406.12793
Model card
Files
Files and versions
Community
109
Train
Deploy
Use this model
refs/pr/110
chatglm-6b
/
modeling_chatglm.py
Commit History
Fix position ids in 1d position encoding
a8ede82
duzx16
commited on
Apr 27, 2023
Fix input embeds
35ca523
duzx16
commited on
Apr 18, 2023
Change mask positions to batch
4de8efe
duzx16
commited on
Apr 14, 2023
Add empty_init option
eb55ff0
duzx16
commited on
Apr 13, 2023
Fix attention score on mps
cde457b
duzx16
commited on
Apr 9, 2023
Fix LogitsProcessor using slim checkpoint (
#29
)
61eee50
zxdu20
bcol
commited on
Apr 8, 2023
Use gmask in first place
9324de7
duzx16
commited on
Apr 6, 2023
Update code for slim
63ce1ba
duzx16
commited on
Apr 6, 2023
fix typo in use_gmask (
#21
)
551a50e
zxdu20
fzhang
commited on
Apr 5, 2023
Fix position ids expand
f82b180
duzx16
commited on
Apr 3, 2023
Fix generate
fb23542
duzx16
commited on
Apr 2, 2023
Fix attention mask for prefix prompt
08bc851
duzx16
commited on
Apr 1, 2023
No padding for chat function
4b7ffbf
duzx16
commited on
Apr 1, 2023
Implement batch generation
cc96a22
duzx16
commited on
Apr 1, 2023
Fix position id for training
11c270c
duzx16
commited on
Mar 31, 2023
fix GLM6BBlock name typo (
#20
)
9c7416d
zxdu20
richbrain
commited on
Mar 31, 2023
Add support for loading quantized model
2e1be30
duzx16
commited on
Mar 31, 2023
Use dynamic dtype for prompts
c949d03
duzx16
commited on
Mar 30, 2023
Fix backward for quantization
0cfae21
duzx16
commited on
Mar 30, 2023
Implement gradient checkpointing
aea6cef
duzx16
commited on
Mar 30, 2023
Fix bugs
0564795
duzx16
commited on
Mar 30, 2023
Add pad_token_id in config.json
2200e2b
duzx16
commited on
Mar 29, 2023
Set ignore_index for CrossEntropyLoss
5c64357
duzx16
commited on
Mar 29, 2023
Support batch training
8127ab6
duzx16
commited on
Mar 29, 2023
Merge branch 'main' into dev_pt
fbda120
duzx16
commited on
Mar 29, 2023
Add p-tuning v2
812f43f
duzx16
commited on
Mar 29, 2023
Fix context length in get_position_ids
096f3de
duzx16
commited on
Mar 28, 2023
Close CPU fusion on Mac
4a9b711
duzx16
commited on
Mar 23, 2023
Fix Chinese punctuation
d2bbc82
duzx16
commited on
Mar 22, 2023
Remove hardcode bos_token_id
2460dc2
duzx16
commited on
Mar 19, 2023
Add support for streaming output
42095d4
duzx16
commited on
Mar 19, 2023
Fix overflow in FP16
220f772
duzx16
commited on
Mar 16, 2023
Set is_parallelizable to False
f9f74fd
duzx16
commited on
Mar 15, 2023
Add logit processor for NaN or Inf scores
c3dece3
duzx16
commited on
Mar 15, 2023
Fix default history argument
9d1509a
duzx16
commited on
Mar 14, 2023
Add support for float32
d4832e8
duzx16
commited on
Mar 14, 2023
Fix past_key_values
cd8041e
duzx16
commited on
Mar 13, 2023
Add chatglm-6b
d11c6aa
Sengxian
commited on
Mar 13, 2023