Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

liked a model about 6 hours ago

nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct

liked a model about 7 hours ago

nari-labs/Dia-1.6B

liked a model 2 days ago

MrDragonFox/mOrpheus_3B-1Base_early_preview

View all activity

Organizations

Enigrand's activity

upvoted a paper 3 days ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published 11 days ago • 53

upvoted a paper 4 days ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published 12 days ago • 27

upvoted a collection 4 days ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 8 days ago • 104

upvoted a paper 4 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 8 days ago • 239

upvoted a paper 5 days ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 6 days ago • 62

upvoted a collection 11 days ago

InternVL3

34 items • Updated 3 days ago • 54

upvoted 3 papers 12 days ago

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published 12 days ago • 34

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published 18 days ago • 76

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Paper • 2504.06514 • Published 14 days ago • 39

upvoted a paper 13 days ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published 15 days ago • 80

upvoted a paper 14 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 15 days ago • 168

upvoted 2 papers 17 days ago

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published 19 days ago • 76

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published 27 days ago • 45

upvoted a collection 19 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 4 days ago • 157

upvoted a paper 19 days ago

PaperBench: Evaluating AI's Ability to Replicate AI Research

Paper • 2504.01848 • Published 20 days ago • 36

upvoted a paper 20 days ago

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published 21 days ago • 64

upvoted a collection 20 days ago

VACE

VACE: All-in-One Video Creation and Editing • 5 items • Updated 16 days ago • 13

upvoted 2 papers 20 days ago

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Paper • 2502.06608 • Published Feb 10 • 41

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published Mar 10 • 46

upvoted a paper 21 days ago

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published 21 days ago • 35