Citaman (Anthonny OLIME)

upvoted a paper 3 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

upvoted an article 3 months ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 474

upvoted a paper 3 months ago

Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling

Paper • 2501.11651 • Published Jan 20 • 1

upvoted a collection 3 months ago

ProLIP

Collection

Official ProLIP weights • 7 items • Updated 25 days ago • 6

upvoted a paper 8 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 69

upvoted 3 papers 10 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 102

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 90

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 55

upvoted a collection 11 months ago

Universal token classification

Collection

Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 11 items • Updated Sep 10, 2024 • 12

upvoted 3 papers 11 months ago

upvoted 2 papers about 1 year ago

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 81

Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

Paper • 2403.18795 • Published Mar 27, 2024 • 21

Anthonny OLIME

AI & ML interests

Organizations

Citaman's activity