tc lin's picture

17 198

tc lin

stuser2023

·

https://github.com/stuser

stuser

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

liked a model 17 days ago

reducto/RolmOCR

liked a model 21 days ago

rasbt/llama-3.2-from-scratch

View all activity

Organizations

None yet

stuser2023's activity

upvoted an article about 2 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

• 73

upvoted a collection about 2 months ago

PaliGemma 2 Mix

13 items • Updated 19 days ago • 60

upvoted a collection 2 months ago

Breeze 2 Family

Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 6 items • Updated Feb 26 • 18

upvoted 2 collections 5 months ago

Cosmos Tokenizer

A suite of image and video tokenizers • 13 items • Updated 8 days ago • 40

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 76

upvoted a collection 6 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated Feb 20 • 253

upvoted a paper 9 months ago

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17, 2024 • 52

upvoted a collection 9 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 22 days ago • 223

upvoted an article 9 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 356

upvoted a collection 10 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 8 days ago • 162

upvoted a collection 12 months ago

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 8 days ago • 44

upvoted an article about 1 year ago

Article

CodeGemma - an official Google release for code LLMs

Apr 9, 2024

• 101

upvoted a paper about 1 year ago

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Paper • 2402.13616 • Published Feb 21, 2024 • 48

upvoted a collection about 1 year ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated 19 days ago • 331

upvoted 2 papers over 1 year ago

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 55

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Paper • 2308.03526 • Published Aug 7, 2023 • 26