Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
36.1
TFLOPS
28
6
48
nicolo
nicolollo
Follow
21world's profile picture
1 follower
ยท
7 following
AI & ML interests
None yet
Recent Activity
liked
a dataset
3 days ago
nexa-collaboration/allava_vflan_dpo_v2
reacted
to
lewtun
's
post
with ๐
8 days ago
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute ๐ฅ How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: ๐ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. ๐ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. ๐งญ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!
reacted
to
burtenshaw
's
post
with โค๏ธ
15 days ago
Quick update from week 1 of smol course. The community is taking the driving seat and using the material for their own projects. If you want to do the same, join in! - we have ongoing translation projects in Korean, Vietnamese, Portuguese, and Spanish - 3 chapters are ready for students. On topics like, instruction tuning, preference alignment, and parameter efficient fine tuning - 3 chapters are in progress on evaluation, vision language models, and synthetic data. - around 780 people have forked the repo to use it for learning, teaching, sharing. โญ๏ธ Next step is to support people that want to use the course for teaching, content creation, internal knowledge sharing, or anything. If you're into this. Drop an issue or PR REPO: https://buff.ly/3ZCMKX2 discord channel: https://buff.ly/4f9F8jA
View all activity
Organizations
models
None public yet
datasets
2
Sort:ย Recently updated
nicolollo/my-distiset
Viewer
โข
Updated
Sep 16
โข
1
โข
44
nicolollo/docci
Viewer
โข
Updated
Jul 11
โข
14.7k
โข
40