一万篇论文笔记's picture

9 141

一万篇论文笔记

10Kpapers

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-V2-Chat

liked a model about 1 month ago

mistralai/Mixtral-8x7B-v0.1

liked a model about 1 month ago

microsoft/Phi-4-multimodal-instruct

View all activity

Organizations

None yet

10Kpapers's activity

upvoted an article about 1 month ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 84

upvoted a collection about 2 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 24 days ago • 78

upvoted an article 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.21k

upvoted 2 collections 2 months ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 24 days ago • 96

DeepSeek-Math

DeepSeek Math series • 4 items • Updated Aug 16, 2024 • 20

upvoted 3 collections 3 months ago

DeepSeek-V2

8 items • Updated Jan 3 • 28

DeepSeek-MoE

DeepSeek MoE series • 3 items • Updated Aug 16, 2024 • 13

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 361