efromomr
/

llm-course-hw2-ppo

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

llm-course-hw2-ppo

1 contributor

History: 5 commits

efromomr's picture

Update README.md

5b51947 verified 10 days ago

.gitattributes

1.52 kB

initial commit 10 days ago
README.md

3.1 kB

Update README.md 10 days ago
config.json

749 Bytes

End of training 10 days ago
generation_config.json

135 Bytes

End of training 10 days ago
merges.txt

466 kB

End of training 10 days ago
model.safetensors

538 MB
LFS

End of training 10 days ago
special_tokens_map.json

658 Bytes

End of training 10 days ago
tokenizer.json

3.52 MB

End of training 10 days ago
tokenizer_config.json

3.65 kB

End of training 10 days ago
training_args.bin
Detected Pickle imports (10)
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.state.PartialState",
- "torch.device",
- "trl.trainer.ppo_config.PPOConfig",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.SaveStrategy"
How to fix it?
6.2 kB
LFS

End of training 10 days ago
vocab.json

801 kB

End of training 10 days ago