new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Feb 20

Submitted by

bluelike

Qwen2.5-VL Technical Report

·
27 authors

Submitted by

HowieHwong

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

·
66 authors

Submitted by

myownskyW7

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

·
9 authors

Submitted by

Hao605

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

·
14 authors

Submitted by

weigao266

MoM: Linear Sequence Modeling with Mixture-of-Memories

·
5 authors

Submitted by

akhaliq

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

·
3 authors

Submitted by

yushi

Craw4LLM: Efficient Web Crawling for LLM Pretraining

·
3 authors

Submitted by

Muennighoff

MMTEB: Massive Multilingual Text Embedding Benchmark

·
86 authors

Submitted by

flydust

Small Models Struggle to Learn from Strong Reasoners

·
8 authors

Submitted by

Guanzheng

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

·
4 authors

Submitted by

michaelzhiluo

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

·
11 authors

Submitted by

akhaliq

Thinking Preference Optimization

·
5 authors

Submitted by

YuchengShi

SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

·
7 authors

Submitted by

sidicity

Presumed Cultural Identity: How Names Shape LLM Responses

·
4 authors

Submitted by

cooperleong00

Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region

·
4 authors

Submitted by

dexhunter

AIDE: AI-Driven Exploration in the Space of Code

·
7 authors

Submitted by

DrishtiSharma

InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

·
20 authors

Submitted by

acharkq

NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation

·
10 authors

Submitted by

yuliang03181

AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence

·
13 authors

Submitted by

junzhang98

Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

·
9 authors

Submitted by

mmhamdy

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

·
8 authors

Submitted by

hamishivi

TESS 2: A Large-Scale Generalist Diffusion Language Model

·
4 authors

Submitted by

danny911kr

REALTALK: A 21-Day Real-World Dataset for Long-Term Conversation

·
5 authors

Submitted by

hyp1231

ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation

·
8 authors

Submitted by

rahmanidashti

Judging the Judges: A Collection of LLM-Generated Relevance Judgements

·
9 authors

Submitted by

fdschmidt93

MVL-SIB: A Massively Multilingual Vision-Language Benchmark for Cross-Modal Topical Matching

·
4 authors

Submitted by

XiangZ

High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion

·
5 authors

Submitted by

oneonlee

REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models

·
2 authors

Submitted by

floschne

GIMMICK -- Globally Inclusive Multimodal Multitask Cultural Knowledge Benchmarking

·
4 authors

Submitted by

nbalepur

Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above

·
3 authors

Submitted by

ludolara

Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval

·
4 authors

Submitted by

yyyaoyuan

Noise May Contain Transferable Knowledge: Understanding Semi-supervised Heterogeneous Domain Adaptation from an Empirical Perspective

·
5 authors