a simpo-like DPO method, trained on simpo data AlpacaEval:44.8(+2)

Downloads last month
12
Inference API
Unable to determine this model's library. Check the docs .

Model tree for zhou-xl/xpo-lla-3-8b-instruct

Finetuned
(523)
this model

Dataset used to train zhou-xl/xpo-lla-3-8b-instruct