DPO fine-tuned models Family, high performance
-
jpacifico/Chocolatine-14B-Instruct-DPO-v1.2
Text Generation • Updated • 7.23k • 14 -
jpacifico/Chocolatine-3B-Instruct-DPO-v1.2
Text Generation • Updated • 2.88k • 9 -
jpacifico/Chocolatine-3B-Instruct-DPO-Revised
Text Generation • Updated • 394 • 26 -
jpacifico/Chocolatine-Admin-3B-SFT-v0.3b
Text Generation • Updated • 129 • 4