Drishti Sharma's picture

Drishti Sharma PRO

DrishtiSharma

·

DrishtiShrrrma

AI & ML interests

None yet

Recent Activity

updated a dataset about 3 hours ago

DrishtiSharma/phi-gradio-logs

updated a Space 3 days ago

DrishtiSharma/patent-generator-v1

published a Space 3 days ago

DrishtiSharma/patent-generator-v1

View all activity

Organizations

DrishtiSharma's activity

upvoted a paper 3 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted a paper 6 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 8 days ago • 87

upvoted an article 22 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

23 days ago

• 65

upvoted 4 papers 23 days ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published 24 days ago • 77

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published 30 days ago • 18

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published 26 days ago • 29

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published 26 days ago • 22

upvoted 2 papers 27 days ago

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published 29 days ago • 22

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published 29 days ago • 30

upvoted 3 papers 28 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 29 days ago • 33

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published 30 days ago • 41

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 29 days ago • 143

upvoted a collection 28 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 14 items • Updated 3 days ago • 102

upvoted 3 papers 29 days ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 86

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 126

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 51

upvoted a paper about 1 month ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 142

upvoted an article about 1 month ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

By

•

Feb 10

• 48

upvoted 2 papers about 1 month ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 60

Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights

Paper • 2403.03506 • Published Mar 6, 2024 • 1