Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated
a model
8 days ago
ricdomolm/ml4331-reward-model
updated
a model
8 days ago
ricdomolm/ml4331-reward-model2
updated
a model
8 days ago
ricdomolm/ml4331-dpo-model
Organizations
None yet
Collections
1
models
65
ricdomolm/ml4331-reward-model
Text Generation
•
Updated
•
97
ricdomolm/ml4331-reward-model2
Text Generation
•
Updated
•
4
ricdomolm/ml4331-dpo-model
Text Generation
•
Updated
•
76
ricdomolm/ml4331-instruction-model
Text Generation
•
Updated
•
164
ricdomolm/test-model
Updated
ricdomolm/SmolLM2-135M-SFT-Alpaca
Updated
ricdomolm/reward-model-exercise
Updated
ricdomolm/lawma-8b
Text Generation
•
Updated
•
1.94k
•
6
ricdomolm/ttt-mc-ziya2-13b-base
Updated
•
2
ricdomolm/ttt-mc-yi-6b
Updated
•
4
datasets
15
ricdomolm/caselawqa_leaderboard_results
Updated
•
1.02k
ricdomolm/caselawqa_leaderboard_requests
Viewer
•
Updated
•
29
•
975
ricdomolm/lawma-instructions_gemma2_8k
Viewer
•
Updated
•
554k
•
41
ricdomolm/lawma-instructions_llama3_16k
Viewer
•
Updated
•
554k
•
31
ricdomolm/lawma-instructions_llama3_8k
Viewer
•
Updated
•
554k
•
33
ricdomolm/lawma-instructions
Viewer
•
Updated
•
554k
•
33
ricdomolm/lawma-tasks
Viewer
•
Updated
•
692k
•
495
•
2
ricdomolm/lawma-task-files
Updated
•
32
ricdomolm/caselawqa-8k
Viewer
•
Updated
•
16.1k
•
34
•
2
ricdomolm/lawma-all-tasks
Viewer
•
Updated
•
575k
•
63