-
The Generative AI Paradox: "What It Can Create, It May Not Understand"
Paper • 2311.00059 • Published • 18 -
Teaching Large Language Models to Reason with Reinforcement Learning
Paper • 2403.04642 • Published • 46 -
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Paper • 2403.07816 • Published • 39 -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 57
xansar
xansar
AI & ML interests
None yet
Recent Activity
liked
a dataset
27 days ago
nlp-guild/medical-data
liked
a dataset
2 months ago
BAAI/IndustryCorpus
liked
a model
6 months ago
Undi95/Meta-Llama-3-8B-hf
Organizations
Collections
1
models
None public yet
datasets
None public yet