jiakai's picture

210 801

jiakai

real-jiakai

·

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

upvoted a paper about 21 hours ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

upvoted a collection about 21 hours ago

liked a model about 21 hours ago

OpenGVLab/InternVL3-78B

View all activity

Organizations

real-jiakai's activity

upvoted a paper about 21 hours ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 2 days ago • 204

upvoted a collection about 21 hours ago

InternVL3

20 items • Updated about 18 hours ago • 46

upvoted an article about 24 hours ago

Article

4M Models Scanned: Protect AI + Hugging Face 6 Months In

3 days ago

• 24

upvoted a collection 1 day ago

DataDecide

A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated about 10 hours ago • 8

upvoted a paper 2 days ago

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published 6 days ago • 23

upvoted a collection 3 days ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 2 days ago • 92

upvoted a paper 3 days ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published 7 days ago • 66

upvoted 2 collections 3 days ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated 6 days ago • 74

Skywork-OR1

Skywork Open Reasoner 1 • 8 items • Updated 4 days ago • 20

upvoted 2 papers 5 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 7 days ago • 110

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 15 days ago • 73

upvoted an article 6 days ago

Article

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

8 days ago

• 19

upvoted 3 papers 7 days ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 8 days ago • 141

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published 9 days ago • 59

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published 9 days ago • 41

upvoted 2 collections 7 days ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 5 days ago • 59

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 14 days ago • 117

upvoted a collection 8 days ago

Cogito v1 Preview

5 items • Updated 9 days ago • 98

upvoted a paper 8 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 9 days ago • 160

upvoted an article 9 days ago

Article

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

9 days ago

• 15