4 25 11

Haokun Lin

Felix1023

https://felixmessi.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both

liked a dataset 4 days ago

erinxia/MedVersa

upvoted a paper 20 days ago

Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

View all activity

Organizations

upvoted a paper 3 days ago

ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both

Paper • 2605.15198 • Published 4 days ago • 17

liked a dataset 4 days ago

erinxia/MedVersa

Viewer • Updated 4 days ago • 6k • 79 • 2

upvoted a paper 20 days ago

Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

Paper • 2604.23775 • Published 22 days ago • 45

upvoted a paper about 1 month ago

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Paper • 2604.04911 • Published Apr 6 • 36

upvoted a paper about 2 months ago

Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR

Paper • 2603.24840 • Published Mar 25 • 2

commented a paper 2 months ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published Feb 23 • 16 •

liked a model 2 months ago

TencentARC/CubeComposer

Video-to-Video • Updated Mar 5 • 97 • 20

upvoted a paper 2 months ago

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

Paper • 2603.04291 • Published Mar 4 • 15

upvoted an article 3 months ago

Article

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

RakshitAralimatti

•

Aug 8, 2025

• 35

submitted a paper to Daily Papers 3 months ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published Feb 23 • 16

upvoted 2 papers 3 months ago

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published Feb 23 • 16

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published Feb 9 • 43

upvoted a paper 5 months ago

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Paper • 2512.14698 • Published Dec 16, 2025 • 22

authored a paper 7 months ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2508.03485 • Published Aug 5, 2025 • 2

upvoted 2 papers 7 months ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2508.03485 • Published Aug 5, 2025 • 2

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 30

liked 2 models 7 months ago

ByteDance/Video-As-Prompt-CogVideoX-5B

Image-to-Video • Updated Oct 27, 2025 • 152 • 23

ByteDance/Video-As-Prompt-Wan2.1-14B

Image-to-Video • Updated Oct 27, 2025 • 152 • 48

upvoted a collection 7 months ago

Video-As-Prompt

Collection

The model zoo for "Video-As-Prompt: Unified Semantic Control for Video Generation" • 3 items • Updated Oct 27, 2025 • 14

liked a dataset 7 months ago

BianYx/VAP-Data

Viewer • Updated Oct 30, 2025 • 90.1k • 808 • 29

Haokun Lin

AI & ML interests

Recent Activity

Organizations

Felix1023's activity

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware