2 37 62

Chao Zhou

ASHIDAKA

AI & ML interests

Object Detection, Transformer

Recent Activity

liked a dataset 17 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

upvoted a paper about 1 month ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

upvoted a paper about 1 month ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

View all activity

Organizations

None yet

ASHIDAKA's activity

liked a dataset 17 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

Viewer • Updated 18 days ago • 15.2M • 11.6k • 315

upvoted 3 papers about 1 month ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 139

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 99

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 178

liked a dataset about 2 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 47.1k • 535

upvoted an article 2 months ago

Article

How to train a Language Model with Megatron-LM

Sep 7, 2022

• 9

liked a model 3 months ago

facebook/multi-token-prediction

Updated Jun 18, 2024 • 368

liked a dataset 3 months ago

allenai/dolma

Updated Apr 17, 2024 • 1.08k • 893

upvoted a collection 4 months ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 23 days ago • 78

upvoted 2 papers 6 months ago

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Paper • 2410.13863 • Published Oct 17, 2024 • 37

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 96

liked a model 6 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • Updated Oct 25, 2024 • 168k • • 2.03k

upvoted a paper 6 months ago

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 65

liked a Space 6 months ago

282

Zero123++ Demo Space

🌒

upvoted a paper 6 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

upvoted a paper 7 months ago

Diffusion Policy Policy Optimization

Paper • 2409.00588 • Published Sep 1, 2024 • 20

liked a model 7 months ago

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Dec 2, 2024 • 57.7k • 273