zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B

Downloads last month
1,430
Safetensors
Model size
7.24B params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for ewqr2130/7B_ppo_phiRM_2GPU_3e-7step_4000

Quantizations
1 model