1 5 4

zuijiang

AI & ML interests

None yet

Recent Activity

liked a Space 25 days ago

nanotron/ultrascale-playbook

upvoted a paper about 1 month ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

upvoted a paper 2 months ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

View all activity

Organizations

zuijiang's activity

liked a Space 25 days ago

2.3k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper about 1 month ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published Feb 3 • 24

upvoted a paper 2 months ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published Jan 3 • 18

commented a paper 2 months ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published Jan 3 • 18 •

authored a paper 4 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 20

upvoted a paper 4 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 20

updated 2 datasets 7 months ago

zuijiang/alpaca-alpaca-clean

Viewer • Updated Aug 26, 2024 • 51.8k • 62

zuijiang/mistral-alpaca-clean

Viewer • Updated Aug 25, 2024 • 51.8k • 49

liked a dataset 8 months ago

AIcell/MOSSBench

Updated 14 days ago • 1.15k • 4

liked a Space 9 months ago

1.86k

Voice Clone

🗣

Clone voice to say text

updated a model 9 months ago

zuijiang/llava-qwen1.5-14B-chat

Text2Text Generation • Updated Jul 1, 2024 • 8

updated a dataset 10 months ago

zuijiang/ocr_vqa

Viewer • Updated May 30, 2024 • 208k • 134

liked a dataset 11 months ago

danielz01/laion-5b

Updated Feb 14, 2024 • 20

upvoted 2 papers over 1 year ago

RAIN: Your Language Models Can Align Themselves without Finetuning

Paper • 2309.07124 • Published Sep 13, 2023 • 3

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Paper • 2309.00267 • Published Sep 1, 2023 • 48