1 58 7

Swasti Sweker

Swekerr

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

s1: Simple test-time scaling

View all activity

Organizations

Swekerr's activity

upvoted a paper 5 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

upvoted an article 16 days ago

Article

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

Dec 16, 2024

• 124

upvoted an article 23 days ago

Article

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 21

upvoted a paper 28 days ago

FFN Fusion: Rethinking Sequential Computation in Large Language Models

Paper • 2503.18908 • Published 29 days ago • 18

upvoted an article about 1 month ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 87

upvoted a paper 2 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 182

upvoted an article 2 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 988

upvoted a paper 2 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 49

upvoted an article 3 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 172

upvoted 2 papers 3 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16 • 23

upvoted an article 3 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 235