Nguyen Van Thanh's picture

3914

Nguyen Van Thanh

NguyenVanThanhHust

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration

upvoted a paper 2 days ago

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam

upvoted a paper 2 days ago

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

View all activity

Organizations

None yet

NguyenVanThanhHust's activity

upvoted 20 papers 2 days ago

Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration

Paper • 2502.17110 • Published Feb 24 • 13

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam

Paper • 2502.17055 • Published Feb 24 • 18

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

Paper • 2502.16033 • Published Feb 22 • 18

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 26

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 29

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 27

GCC: Generative Color Constancy via Diffusing a Color Checker

Paper • 2502.17435 • Published Feb 24 • 28

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published Feb 23 • 37

Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space

Paper • 2503.09419 • Published Mar 12 • 6

Self-Taught Self-Correction for Small Language Models

Paper • 2503.08681 • Published Mar 11 • 14

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

Paper • 2503.09402 • Published Mar 12 • 7

WildIFEval: Instruction Following in the Wild

Paper • 2503.06573 • Published Mar 9 • 12

Quantizing Large Language Models for Code Generation: A Differentiated Replication

Paper • 2503.07103 • Published Mar 10 • 8

More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG

Paper • 2503.04388 • Published Mar 6 • 16

Motion Anything: Any to Motion Generation

Paper • 2503.06955 • Published Mar 10 • 32

RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling

Paper • 2503.09601 • Published Mar 12 • 15

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11 • 17

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

Paper • 2503.09151 • Published Mar 12 • 32

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 71

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published Mar 12 • 45