Commit History

fixes for dpo and orpo template loading (#1424)
7803f09
unverified

winglian commited on

ORPO (#1419)
2ea70eb
unverified

winglian commited on

add handling for argilla dpo-mix (#1397)
8a82d2e
unverified

winglian commited on

Support user-defined prompt processing strategies for dpo (#1248)
1e3d530
unverified

nopperl winglian commited on

precompute dpo logprobs setting and fixes (#1199) [skip ci]
33e1170
unverified

winglian commited on

DPO cleanup (#1126)
7523d1f
unverified

winglian plaguss HF staff commited on