Nicholas Broad's picture

Nicholas Broad PRO

nbroad

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Hugging Face's profile picture Hugging Face Success Team's profile picture Flax Community's profile picture Demo Corp's profile picture Atmos Bank's profile picture Hugging Test Lab's profile picture kisholas's profile picture Sphere Spring 2023 Class's profile picture HF Canonical Model Maintainers's profile picture Carmaker Quality Datalab's profile picture Open-Source AI Meetup's profile picture Inference Endpoints's profile picture Blog-explorers's profile picture Nicholas Broad's profile picture llm-sci-exam-anrut's profile picture Enterprise Explorers's profile picture dori's profile picture Voya Financial's profile picture Goodby Silverstein & Partners's profile picture Uber's profile picture Onehot Labs's profile picture Johnson & Johnson's profile picture Gorgias's profile picture llm daigt's profile picture Hapax's profile picture team's profile picture Hugging Face Party @ PyTorch Conference's profile picture kaggle eedi team (improved-barnacle)'s profile picture Nerdy Face's profile picture open/ acc's profile picture

nbroad's activity

New activity in huggingface/brand-assets 4 days ago
liked a Space 5 days ago
reacted to lewtun's post with 🔥 6 days ago
view post
Post
3178
I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive!

https://x.com/casper_hansen_/status/1875872309996855343

Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025!

[1] Training Large Language Models to Reason in a Continuous Latent Space (2412.06769)
[2] https://huggingface.co/blog/ganqu/prime