Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
Rodolfo Amorim
Dolfini
Follow
0 followers
ยท
3 following
AI & ML interests
None yet
Recent Activity
liked
a model
17 days ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
reacted
to
reach-vb
's
post
with ๐
4 months ago
Smol models ftw! AMD released AMD OLMo 1B - beats OpenELM, tiny llama on MT Bench, Alpaca Eval - Apache 2.0 licensed ๐ฅ > Trained with 1.3 trillion (dolma 1.7) tokens on 16 nodes, each with 4 MI250 GPUs > Three checkpoints: - AMD OLMo 1B: Pre-trained model - AMD OLMo 1B SFT: Supervised fine-tuned on Tulu V2, OpenHermes-2.5, WebInstructSub, and Code-Feedback datasets - AMD OLMo 1B SFT DPO: Aligned with human preferences using Direct Preference Optimization (DPO) on UltraFeedback dataset Key Insights: > Pre-trained with less than half the tokens of OLMo-1B > Post-training steps include two-phase SFT and DPO alignment > Data for SFT: - Phase 1: Tulu V2 - Phase 2: OpenHermes-2.5, WebInstructSub, and Code-Feedback > Model checkpoints on the Hub & Integrated with Transformers โก๏ธ Congratulations & kudos to AMD on a brilliant smol model release! ๐ค https://huggingface.co/collections/amd/amd-olmo-6723e7d04a49116d8ec95070
liked
a model
4 months ago
allenai/tulu-2-dpo-70b
View all activity
Organizations
None yet
Dolfini
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
17 days ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
โข
Updated
6 days ago
โข
420k
โข
โข
573
liked
a model
4 months ago
allenai/tulu-2-dpo-70b
Text Generation
โข
Updated
Jan 31, 2024
โข
3.74k
โข
156