ML Foundations

non-profit

AI & ML interests

None defined yet.

Recent Activity

jieyuz2 authored a paper 9 days ago

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

hbXNov authored a paper 9 days ago

When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning

hbXNov authored a paper 18 days ago

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

View all activity

mlfoundations's activity

jieyuz2

authored a paper 9 days ago

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

Paper • 2503.23361 • Published 12 days ago • 6

hbXNov

authored a paper 9 days ago

When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning

Paper • 2504.01005 • Published 9 days ago • 15

hbXNov

authored a paper 18 days ago

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

Paper • 2503.17352 • Published 20 days ago • 21

achal-tri

authored a paper about 1 month ago

Should VLMs be Pre-trained with Image Data?

Paper • 2503.07603 • Published Mar 10 • 3

samirg

authored a paper about 1 month ago

Should VLMs be Pre-trained with Image Data?

Paper • 2503.07603 • Published Mar 10 • 3

jmercat

authored a paper about 1 month ago

Should VLMs be Pre-trained with Image Data?

Paper • 2503.07603 • Published Mar 10 • 3

sedrickkeh

authored a paper about 1 month ago

Should VLMs be Pre-trained with Image Data?

Paper • 2503.07603 • Published Mar 10 • 3

JJitsev

authored a paper about 1 month ago

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Paper • 2502.19413 • Published Feb 26 • 19

alon-albalak

authored a paper about 1 month ago

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Paper • 2502.17387 • Published Feb 24 • 5

alon-albalak

authored a paper 3 months ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 96

jieyuz2

authored 10 papers 4 months ago

DataComp: In search of the next generation of multimodal datasets

Paper • 2304.14108 • Published Apr 27, 2023 • 2

Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias

Paper • 2306.15895 • Published Jun 28, 2023

SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality

Paper • 2306.14610 • Published Jun 26, 2023

Subclass-balancing Contrastive Learning for Long-tailed Recognition

Paper • 2306.15925 • Published Jun 28, 2023

WRENCH: A Comprehensive Benchmark for Weak Supervision

Paper • 2109.11377 • Published Sep 23, 2021

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework

Paper • 2308.08155 • Published Aug 16, 2023 • 7

When to Learn What: Model-Adaptive Data Augmentation Curriculum

Paper • 2309.04747 • Published Sep 9, 2023

A Survey on Programmatic Weak Supervision

Paper • 2202.05433 • Published Feb 11, 2022

Training Language Model Agents without Modifying Language Models

Paper • 2402.11359 • Published Feb 17, 2024 • 2

m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

Paper • 2403.11085 • Published Mar 17, 2024