pablo oliva's picture

4 3

pablo oliva

poliva

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

chandar-lab/NeoBERT

upvoted a paper 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

View all activity

Organizations

None yet

poliva's activity

upvoted a paper 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 275

upvoted 2 collections about 1 year ago

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12, 2024 • 68

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 131