2 33 97

Joshua Chris

KrisKale45

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

unsloth/Llama-4-Scout-17B-16E-Instruct

liked a model 23 days ago

erax-ai/EraX-WoW-Turbo-V1.0

upvoted an article 24 days ago

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

View all activity

Organizations

None yet

KrisKale45's activity

upvoted an article 24 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

26 days ago

• 373

upvoted 3 collections 2 months ago

upvoted an article 2 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 835

upvoted a collection 2 months ago

Llama 3.2

Collection

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated 1 day ago • 60

upvoted 3 papers 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 275

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published Dec 11, 2024 • 18

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published Jan 6 • 37

upvoted an article 6 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 187

upvoted 2 papers 7 months ago

Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models

Paper • 2409.12139 • Published Sep 18, 2024 • 12

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50

upvoted 2 papers 8 months ago

FocusLLM: Scaling LLM's Context by Parallel Decoding

Paper • 2408.11745 • Published Aug 21, 2024 • 25

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 52

upvoted an article 8 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 110