hlzhang109's picture

2 2

hlzhang109

hlzhang109

·

AI & ML interests

None yet

Recent Activity

liked a Space 22 days ago

nanotron/ultrascale-playbook

authored a paper 3 months ago

Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark

authored a paper 3 months ago

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

View all activity

Organizations

hlzhang109's activity

liked a Space 22 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

authored 5 papers 3 months ago

Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark

Paper • 2304.03279 • Published Apr 6, 2023 • 1

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Paper • 2406.10670 • Published Jun 15, 2024 • 4

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 50

Eliminating Position Bias of Language Models: A Mechanistic Approach

Paper • 2407.01100 • Published Jul 1, 2024 • 8

Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Paper • 2412.02674 • Published Dec 3, 2024

New activity in hlzhang109/CoLoR-filter 9 months ago

Update README.md

#2 opened 9 months ago by

davidbrandfonbrener

updated a model 9 months ago

hlzhang109/CoLoR-filter

Updated Jun 15, 2024

New activity in hlzhang109/CoLoR-filter 9 months ago

Create README.md

#1 opened 9 months ago by

davidbrandfonbrener

liked a dataset almost 2 years ago

togethercomputer/RedPajama-Data-1T

Viewer • Updated Jun 17, 2024 • 1.73M • 1.9k • 1.08k