Data, embedding, and index of MassiveDS by "Scaling Retrieval-Based Language Models with a Trillion-Token Datastore"
Rulin Shao
rulins
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
rulins/SimpleQA-synthetic-datastore-Llama3.3-70B-Instruct
published
a dataset
2 days ago
rulins/SimpleQA-synthetic-datastore-Llama3.3-70B-Instruct
updated
a dataset
2 days ago
rulins/hotpotqa_query_rewriting_sft_data_it0_of5_llama3.2_3b_10fs_dspy
Organizations
Collections
1
datasets
31
rulins/SimpleQA-synthetic-datastore-Llama3.3-70B-Instruct
Viewer
•
Updated
•
4.33k
•
12
rulins/hotpotqa_query_rewriting_sft_data_it0_of5_llama3.2_3b_10fs_dspy
Viewer
•
Updated
•
30.4k
•
18
rulins/hotpotqa_query_preference_data_it0_of5_llama3.2_3b_10fs_dspy
Updated
•
8
rulins/dpr_wiki_nq_open_searched_results
Viewer
•
Updated
•
3.61k
•
31
rulins/reasonir-vl-short
Viewer
•
Updated
•
210k
•
23
rulins/reasonir-vl-long
Viewer
•
Updated
•
35.3k
•
23
rulins/gpqa_reasoning_queries_retrieved_results_contriever_rerank
Viewer
•
Updated
•
198
•
22
rulins/gpqa_original_queries_retrieved_results_reasonir
Viewer
•
Updated
•
198
•
29
rulins/gpqa_original_queries
Viewer
•
Updated
•
198
•
23
rulins/massiveds_gpqa_qwen2.5_7b_it_reasoning_queries_searched_results
Viewer
•
Updated
•
198
•
28