Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
beomi
/
KoRWKV-1.5B
like
13
Text Generation
Transformers
PyTorch
Safetensors
Korean
doi:10.57967/hf/1293
rwkv
KoRWKV
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
7f95d1b
KoRWKV-1.5B
1 contributor
History:
30 commits
beomi
Update README.md
7f95d1b
over 1 year ago
.gitattributes
1.48 kB
initial commit
over 1 year ago
README.md
2.93 kB
Update README.md
over 1 year ago
config.json
454 Bytes
Add bfloat16 weight, with 4.37M step trained
over 1 year ago
generation_config.json
116 Bytes
test
over 1 year ago
model-00001-of-00004.safetensors
994 MB
LFS
Add 1.5B release model (trained with 2048 batch)
over 1 year ago
model-00002-of-00004.safetensors
999 MB
LFS
Add 1.5B release model (trained with 2048 batch)
over 1 year ago
model-00003-of-00004.safetensors
839 MB
LFS
Add 1.5B release model (trained with 2048 batch)
over 1 year ago
model-00004-of-00004.safetensors
213 MB
LFS
Add 1.5B release model (trained with 2048 batch)
over 1 year ago
model.safetensors.index.json
34.8 kB
Add bfloat16 weight, with 4.37M step trained
over 1 year ago
original.pth
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
What is a pickle import?
3.04 GB
LFS
Add ~58B tokens ckpt
over 1 year ago
pytorch_model-00001-of-00004.bin
994 MB
LFS
Add 1.5B release model (trained with 2048 batch)
over 1 year ago
pytorch_model-00002-of-00004.bin
999 MB
LFS
Add 1.5B release model (trained with 2048 batch)
over 1 year ago
pytorch_model-00003-of-00004.bin
839 MB
LFS
Add 1.5B release model (trained with 2048 batch)
over 1 year ago
pytorch_model-00004-of-00004.bin
213 MB
LFS
Add 1.5B release model (trained with 2048 batch)
over 1 year ago
pytorch_model.bin.index.json
34.8 kB
Add bfloat16 weight, with 4.37M step trained
over 1 year ago
special_tokens_map.json
93 Bytes
test
over 1 year ago
tokenizer.json
2.69 MB
test
over 1 year ago
tokenizer_config.json
236 Bytes
test
over 1 year ago