Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2503.14456

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 27
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 13
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 43
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 22

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published Apr 24, 2024 • 28
MoDE: CLIP Data Experts via Clustering

Paper • 2404.16030 • Published Apr 24, 2024 • 14
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Paper • 2405.12130 • Published May 20, 2024 • 50
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Paper • 2405.12981 • Published May 21, 2024 • 32

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 13 days ago • 131

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 26 days ago • 221
Transformers without Normalization

Paper • 2503.10622 • Published 18 days ago • 143
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 13 days ago • 131
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published 17 days ago • 126

about 4 hours ago

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published about 1 month ago • 126
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 24 days ago • 111
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 24 days ago • 43
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published 24 days ago • 25

RWKV-7 Goose related resources.

Goose-World/RWKV-World-v3

Viewer • Updated 12 days ago • 1.1M • 827 • 1
BlinkDL/rwkv-7-world

Text Generation • Updated Feb 10 • 89
BlinkDL/rwkv-7-pile

Updated Dec 19, 2024 • 15
Sleeping

2

2

RWKV 7

🌏

best foundation model for its size !

Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM

Paper • 2502.06635 • Published Feb 10 • 4
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 215
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 98
MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 282

interesting papers

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 130
Agency Is Frame-Dependent

Paper • 2502.04403 • Published Feb 6 • 22
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 46
LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published Feb 12 • 28

Daily Research Papers

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 59
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 13 days ago • 131
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published 12 days ago • 44
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published 12 days ago • 43

fla-hub/rwkv7-2.9B-world

Text Generation • Updated 11 days ago • 305 • 4
fla-hub/rwkv7-1.5B-world

Text Generation • Updated 11 days ago • 300 • 9
fla-hub/rwkv7-191M-world

Text Generation • Updated 11 days ago • 317 • 1
fla-hub/rwkv7-168M-pile

Text Generation • Updated 11 days ago • 54 • 5

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs