Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated
a dataset
12 days ago
ricdomolm/lawma-instructions_llama3_8k_songer
published
a dataset
12 days ago
ricdomolm/lawma-instructions_llama3_8k_songer
liked
a dataset
23 days ago
nvidia/OpenMathInstruct-2
Organizations
None yet
Collections
1
models
69

ricdomolm/lawma-8b
Text Generation
•
Updated
•
2.88k
•
8

ricdomolm/smollm
Updated

ricdomolm/llama-3.2-1b-it-test
Updated

ricdomolm/pythia-1.4b-sft-gsm8k-3e
Text Generation
•
Updated
•
16

ricdomolm/pythia-1.4b-sft-gsm8k-1e
Text Generation
•
Updated
•
16

ricdomolm/ml4331-reward-model
Text Generation
•
Updated
•
10

ricdomolm/ml4331-reward-model2
Text Generation
•
Updated
•
5

ricdomolm/ml4331-dpo-model
Text Generation
•
Updated
•
6

ricdomolm/ml4331-instruction-model
Text Generation
•
Updated
•
9

ricdomolm/test-model
Updated
datasets
30
ricdomolm/lawma-instructions_llama3_8k_songer
Viewer
•
Updated
•
442k
•
52
ricdomolm/lawma-instructions_llama3_4k
Viewer
•
Updated
•
554k
•
107
ricdomolm/lawma-instructions_pythia_2k
Viewer
•
Updated
•
554k
•
112
ricdomolm/lawma-instructions_llama3_2k
Viewer
•
Updated
•
554k
•
91
ricdomolm/lawma-instructions_llama3_1k
Viewer
•
Updated
•
554k
•
92
ricdomolm/caselawqa-subtasks-8k
Viewer
•
Updated
•
90.9k
•
1.61k
ricdomolm/caselawqa-8k
Viewer
•
Updated
•
22k
•
348
•
3
ricdomolm/caselawqa-8k-all
Viewer
•
Updated
•
11k
•
182
ricdomolm/Big-Math-RL-Verified-Solve-Rate-0.5
Viewer
•
Updated
•
135k
•
67
ricdomolm/orz_math_57k
Viewer
•
Updated
•
56.9k
•
57