Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dicta-il
/
dictalm-7b
like
7
Follow
DICTA: The Israel Center for Text Analysis
41
Text Generation
Transformers
PyTorch
Safetensors
Hebrew
megatron_gpt
custom_code
arxiv:
2309.14568
License:
cc-by-4.0
Model card
Files
Files and versions
Community
1
Train
Use this model
main
dictalm-7b
2 contributors
History:
6 commits
Shaltiel
SFconvertbot
Adding `safetensors` variant of this model (
#1
)
c233431
verified
9 months ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
README.md
Safe
3.74 kB
Update README.md
about 1 year ago
config.json
Safe
1.01 kB
Upload 11 files
over 1 year ago
configuration_megatron_gpt.py
Safe
9.57 kB
Updated flash attention usage
over 1 year ago
generation_config.json
Safe
132 Bytes
Upload 11 files
over 1 year ago
merges.txt
Safe
1.27 MB
Upload 11 files
over 1 year ago
model-00001-of-00002.safetensors
Safe
9.97 GB
LFS
Adding `safetensors` variant of this model (#1)
9 months ago
model-00002-of-00002.safetensors
Safe
950 MB
LFS
Adding `safetensors` variant of this model (#1)
9 months ago
model.safetensors.index.json
Safe
41.1 kB
Adding `safetensors` variant of this model (#1)
9 months ago
modeling_megatron_gpt.py
Safe
55.1 kB
Updated flash attention usage
over 1 year ago
pytorch_model-00001-of-00002.bin
Safe
pickle
Detected Pickle imports (3)
"torch.HalfStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
9.97 GB
LFS
Upload 11 files
over 1 year ago
pytorch_model-00002-of-00002.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.HalfStorage"
What is a pickle import?
950 MB
LFS
Upload 11 files
over 1 year ago
pytorch_model.bin.index.json
Safe
39.4 kB
Upload 11 files
over 1 year ago
special_tokens_map.json
Safe
567 Bytes
Upload 11 files
over 1 year ago
tokenizer_config.json
Safe
890 Bytes
Upload 11 files
over 1 year ago
vocab.json
Safe
1.88 MB
Upload 11 files
over 1 year ago