Monet: Mixture of Monosemantic Experts for Transformers Paper • 2412.04139 • Published 20 days ago • 10
Inference-Time Intervention (ITI) Models Collection A collection of Llama models with Inference-Time Intervention (Li et al.) applied to them. Codebase: https://github.com/likenneth/honest_llama • 6 items • Updated Aug 24 • 3
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21 • 29
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19 • 47
ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction? Paper • 2411.06469 • Published Nov 10 • 17
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published Nov 11 • 34
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published Nov 6 • 30
Survey of Cultural Awareness in Language Models: Text and Beyond Paper • 2411.00860 • Published Oct 30 • 23
LOGO -- Long cOntext aliGnment via efficient preference Optimization Paper • 2410.18533 • Published Oct 24 • 42
MedMobile: A mobile-sized language model with expert-level clinical capabilities Paper • 2410.09019 • Published Oct 11 • 8
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model Paper • 2410.13639 • Published Oct 17 • 16
BenTo: Benchmark Task Reduction with In-Context Transferability Paper • 2410.13804 • Published Oct 17 • 19
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts Paper • 2410.10626 • Published Oct 14 • 37
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning Paper • 2410.09754 • Published Oct 13 • 7
Thinking LLMs: General Instruction Following with Thought Generation Paper • 2410.10630 • Published Oct 14 • 17
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains Paper • 2410.09870 • Published Oct 13 • 7