nDimensional

4 92 621

AI & ML interests

Computer Vision, Diffusers, Transformers, PyTorch, JAX, Triton

Recent Activity

liked a model 15 days ago

google/diffusiongemma-26B-A4B-it

liked a model 15 days ago

Photoroom/prxpixel-t2i

updated a collection 26 days ago

Agent Collaborations

View all activity

Organizations

upvoted a paper about 2 months ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 116

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted a collection 3 months ago

Gemma 4

Collection

15 items • Updated 21 days ago • 1k

upvoted a paper 3 months ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published Mar 24 • 62

upvoted 3 papers 4 months ago

upvoted 7 papers 5 months ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 201

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published Feb 2 • 140

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 154

Closing the Loop: Universal Repository Representation with RPG-Encoder

Paper • 2602.02084 • Published Feb 2 • 85

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published Feb 2 • 118

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 275

upvoted 6 papers 6 months ago

Controlled Self-Evolution for Algorithmic Code Optimization

Paper • 2601.07348 • Published Jan 12 • 115

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 234

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Paper • 2601.02204 • Published Jan 5 • 64

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published Dec 18, 2025 • 121

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 91

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 174

nDimensional

AI & ML interests

Recent Activity

Organizations

nDimensional's activity

Welcome Gemma 4: Frontier multimodal intelligence on device