-
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 2.16M • • 2.71k -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 520k • • 4.38k -
mistralai/Mixtral-8x7B-v0.1
Text Generation • Updated • 45.5k • • 1.7k -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 60
Molone Laveh PRO
molonelaveh
AI & ML interests
convergence, multi-modality, multi-agent, LLM, research
Recent Activity
liked
a model
11 days ago
stabilityai/stable-video-diffusion-img2vid
liked
a model
11 days ago
stabilityai/stable-video-diffusion-img2vid-xt
liked
a Space
20 days ago
Pendrokar/TTS-Spaces-Arena
Organizations
Collections
2
models
None public yet
datasets
None public yet