Olivia S's picture

29 33

Olivia S

taygetea

·

AI & ML interests

None yet

Recent Activity

liked a model 12 days ago

sphiratrioth666/Character_Generation_Templates

upvoted a paper about 1 month ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

upvoted a paper about 1 month ago

Almost Surely Safe Alignment of Large Language Models at Inference-Time

View all activity

Organizations

None yet

taygetea's activity

upvoted 20 papers about 1 month ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Paper • 2501.18965 • Published Jan 31 • 7

Almost Surely Safe Alignment of Large Language Models at Inference-Time

Paper • 2502.01208 • Published Feb 3 • 11

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

Paper • 2502.01839 • Published Feb 3 • 7

Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations

Paper • 2501.19066 • Published Jan 31 • 12

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published Feb 4 • 15

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Paper • 2502.02339 • Published Feb 4 • 22

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Paper • 2502.01506 • Published Feb 3 • 33

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published Feb 6 • 13

PILAF: Optimal Human Preference Sampling for Reward Modeling

Paper • 2502.04270 • Published Feb 6 • 11

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Paper • 2502.04306 • Published Feb 6 • 19

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6 • 24

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6 • 31

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published Feb 6 • 35

SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs

Paper • 2502.02909 • Published Feb 5 • 2

Value-Based Deep RL Scales Predictably

Paper • 2502.04327 • Published Feb 6 • 6

CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

Paper • 2502.04350 • Published Feb 4 • 11

QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Paper • 2502.05178 • Published Feb 7 • 10

Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

Paper • 2502.03738 • Published Feb 6 • 11

CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference

Paper • 2502.04416 • Published Feb 6 • 12

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 18