Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Azurro
/
APT3-1B-Base
like
14
Follow
Azurro
8
Text Generation
Transformers
Safetensors
chrisociepa/wikipedia-pl-20230401
Polish
llama
ALLaMo
text-generation-inference
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
d67a4a9
APT3-1B-Base
1 contributor
History:
6 commits
chrisociepa
Update README.md
d67a4a9
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
9.56 kB
Update README.md
10 months ago
allamo_config_ckpt.pt
pickle
Detected Pickle imports (4)
"model.AllamoTransformerConfig"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
How to fix it?
3.5 kB
LFS
Upload 9 files
10 months ago
allamo_model_ckpt.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
4.17 GB
LFS
Upload 9 files
10 months ago
allamo_optimizer_ckpt.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
8.33 GB
LFS
Upload 9 files
10 months ago
apt3-1b-base-eval.jpg
117 kB
Upload 2 files
10 months ago
apt3-1b-base-train.jpg
179 kB
Upload 2 files
10 months ago
config.json
608 Bytes
Upload 9 files
10 months ago
generation_config.json
111 Bytes
Upload 9 files
10 months ago
model.safetensors
4.17 GB
LFS
Upload 9 files
10 months ago
special_tokens_map.json
96 Bytes
Upload 9 files
10 months ago
tokenizer.json
1.42 MB
Upload 9 files
10 months ago
tokenizer_config.json
281 Bytes
Upload 9 files
10 months ago