Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2406.04093

Self-Refine: Iterative Refinement with Self-Feedback

Paper • 2303.17651 • Published Mar 30, 2023 • 2
Gorilla: Large Language Model Connected with Massive APIs

Paper • 2305.15334 • Published May 24, 2023 • 5
PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15, 2024 • 57
Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 16

Papers - Training - Sparse Learning - k-Sparse Autoencoder

k-Sparse Autoencoders

Paper • 1312.5663 • Published Dec 19, 2013 • 1
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3

Papers - Text - SAE - Sparse Autoencoders

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21, 2024 • 9
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9, 2024 • 38
Disentangling Dense Embeddings with Sparse Autoencoders

Paper • 2408.00657 • Published Aug 1, 2024 • 1

Papers - Training - Scaling Properties

Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Paper • 2404.05405 • Published Apr 8, 2024 • 10
Scaling Laws for Precision

Paper • 2411.04330 • Published Nov 7, 2024 • 7
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3

mechanistic interpretability with sparse autoencoders

A collection of papers that I found useful for learning about using Sparse Autoencoders for finding interpretable features in language models

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 13
Scaling and evaluating sparse autoencoders

Paper • 2406.04093 • Published Jun 6, 2024 • 3
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Paper • 2403.19647 • Published Mar 28, 2024 • 3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9, 2024 • 38

Papers - Training

SELF: Language-Driven Self-Evolution for Large Language Model

Paper • 2310.00533 • Published Oct 1, 2023 • 2
GrowLength: Accelerating LLMs Pretraining by Progressively Growing Training Length

Paper • 2310.00576 • Published Oct 1, 2023 • 2
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

Paper • 2305.13169 • Published May 22, 2023 • 3
Transformers Can Achieve Length Generalization But Not Robustly

Paper • 2402.09371 • Published Feb 14, 2024 • 13

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs