15 623 258

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper 2 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

liked a model 2 days ago

microsoft/MAI-DS-R1

upvoted a paper 3 days ago

BitNet b1.58 2B4T Technical Report

View all activity

Organizations

taufiqdp's activity

upvoted a paper 2 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 3 days ago • 80

liked a model 2 days ago

microsoft/MAI-DS-R1

Updated 3 days ago • 153 • 144

upvoted a paper 3 days ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 4 days ago • 53

liked a model 3 days ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • Updated about 5 hours ago • 9.85k • 496

updated a Space 3 days ago

FLUX

🖼

Generate images from text prompts

upvoted 4 papers 4 days ago

The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer

Paper • 2504.10462 • Published 6 days ago • 13

liked a dataset 5 days ago

openai/mrcr

Viewer • Updated 6 days ago • 2.4k • 1.95k • 109

upvoted a paper 5 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 13 days ago • 112

upvoted 2 papers 6 days ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published 9 days ago • 37

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 9 days ago • 117

liked a model 8 days ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • Updated about 5 hours ago • 28.4k • 367

upvoted 2 papers 9 days ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published 10 days ago • 25

Kimi-VL Technical Report

Paper • 2504.07491 • Published 10 days ago • 112

upvoted a paper 10 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 12 days ago • 70

upvoted a collection 11 days ago

Cogito v1 Preview

Collection

5 items • Updated 12 days ago • 101

upvoted a paper 11 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 13 days ago • 162

upvoted a paper 12 days ago

Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models

Paper • 2504.04823 • Published 13 days ago • 29