Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
4 days ago
hamishivi/GeneralThought-430K-filtered-thinker
updated
a dataset
4 days ago
hamishivi/tulu-3-sft-t3-70b-thinker
Organizations
models
34

hamishivi/s1k_seq_orig_hyper__42__1740446762
Updated
•
163

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt
Updated
•
824

hamishivi/tulu-2-wildchat-326k-sft
Updated
•
23

hamishivi/tulu-2-arena-hard-326k-sft
Updated
•
26

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft
Updated
•
29

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft
Updated
•
15

hamishivi/tulu-2-multitask-rrmax-326k-sft
Updated
•
45

hamishivi/qwen2_math_tokenizer_tweaked
Updated

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350
Updated
•
4

hamishivi/0224_jupiter_hamish_grpo_s1k_only_orz_24021
Updated
•
3
datasets
43
hamishivi/GeneralThought-430K-filtered-thinker
Viewer
•
Updated
•
296k
•
45
hamishivi/tulu-3-sft-t3-70b-thinker
Viewer
•
Updated
•
932k
•
66
hamishivi/SimpleQA-RLVR
Viewer
•
Updated
•
4.33k
•
93
hamishivi/GPQA-RLVR
Viewer
•
Updated
•
546
•
51
hamishivi/GPQA-train-RLVR
Viewer
•
Updated
•
348
•
92
hamishivi/200k-tulu-2-unbalanced
Viewer
•
Updated
•
200k
•
56
hamishivi/lsds_data
Preview
•
Updated
•
136
hamishivi/rds-sels-tydiqa-shots-top326k
Viewer
•
Updated
•
326k
•
110
hamishivi/rds-sels-squad-top326k
Viewer
•
Updated
•
326k
•
121
hamishivi/rds-sels-mmlu-shots-top326k
Viewer
•
Updated
•
326k
•
143