1 6 5

Shreyas Jena

jena-shreyas

AI & ML interests

None yet

Recent Activity

updated a model 16 days ago

jena-shreyas/gemma-2-2b-it-toxic-100

published a model 16 days ago

jena-shreyas/gemma-2-2b-it-safety-vector

published a model 16 days ago

jena-shreyas/gemma-2-2b-it-toxic-100

View all activity

Organizations

jena-shreyas's activity

updated a model 16 days ago

jena-shreyas/gemma-2-2b-it-toxic-100

Updated 16 days ago • 19

published 2 models 16 days ago

jena-shreyas/gemma-2-2b-it-safety-vector

Updated 16 days ago

jena-shreyas/gemma-2-2b-it-toxic-100

Updated 16 days ago • 19

updated 2 models 17 days ago

jena-shreyas/gemma-2-2b-it-peft-dare

Text Generation • Updated 17 days ago • 4

jena-shreyas/gemma-2-2b-it-sft-dare

Text Generation • Updated 17 days ago • 9 • 1

published 2 models 17 days ago

jena-shreyas/gemma-2-2b-it-sft-dare

Text Generation • Updated 17 days ago • 9 • 1

jena-shreyas/gemma-2-2b-it-peft-dare

Text Generation • Updated 17 days ago • 4

updated a model 18 days ago

jena-shreyas/gemma-2-2b-it-peft-code-alpaca

Updated 18 days ago • 26

published a model 18 days ago

jena-shreyas/gemma-2-2b-it-peft-code-alpaca

Updated 18 days ago • 26

updated a model 18 days ago

jena-shreyas/gemma-2-2b-it-sft-code-alpaca

Text Generation • Updated 18 days ago • 45

published a model 18 days ago

jena-shreyas/gemma-2-2b-it-sft-code-alpaca

Text Generation • Updated 18 days ago • 45

updated a model 2 months ago

jena-shreyas/flux-lora-wheels

Text-to-Image • Updated Feb 11 • 12

published a model 2 months ago

jena-shreyas/flux-lora-wheels

Text-to-Image • Updated Feb 11 • 12

reacted to Kseniase's post with 👍 3 months ago

Post

2070

10 Recent Advancements in Math Reasoning

Over the last few weeks, we have witnessed a surge in AI models' math reasoning capabilities. Top companies like Microsoft, NVIDIA, and Alibaba Qwen have already joined this race to make models "smarter" in mathematics. But why is this shift happening now?

Complex math calculations require advanced multi-step reasoning, making mathematics an ideal domain for demonstrating a model's strong "thinking" capabilities. Additionally, as AI continues to evolve and is applied in math-intensive fields such as machine learning and quantum computing (which is predicted to see significant growth in 2025), it must meet the demands of complex reasoning.
Moreover, AI models can be integrated with external tools like symbolic solvers or computational engines to tackle large-scale math problems, which also needs high-quality math reasoning.

So here’s a list of 10 recent advancements in math reasoning of AI models:

1. NVIDIA: AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling (2412.15084)

2. Qwen, Alibaba: Qwen2.5-Math-PRM The Lessons of Developing Process Reward Models in Mathematical Reasoning (2501.07301) and PROCESSBENCH evaluation ProcessBench: Identifying Process Errors in Mathematical Reasoning (2412.06559)

3. Microsoft Research: rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking (2501.04519)

4. BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning (2501.03226)

5. URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics (2501.04686)

6. U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in LLMs (2412.03205)

7. Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs (2501.06430)

8. End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach (2501.04425)

9. Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning (2501.03035)

10. System-2 Mathematical Reasoning via Enriched Instruction Tuning (2412.16964)

upvoted a collection 3 months ago

Emu3

Collection

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 13 • 70

liked a Space 4 months ago

12.9k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

upvoted a paper 6 months ago

Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Paper • 2408.02442 • Published Aug 5, 2024 • 21

upvoted a paper 7 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19, 2024 • 26

updated 2 models 7 months ago

jena-shreyas/paligemma_vqav2_full_ft_high_rank

Updated Sep 15, 2024

jena-shreyas/florence_ft

Text Generation • Updated Sep 15, 2024