Enhancing Paraphrase Type Generation: The Impact of DPO and RLHF Evaluated with Human-Ranked Data
-
cluebbers/Llama-3.1-8B-paraphrase-type-generation-apty-ipo
Text Generation • Updated • 5 -
cluebbers/Llama-3.1-8B-paraphrase-type-generation-etpc
Text Generation • Updated • 3 -
cluebbers/Llama-3.1-8B-paraphrase-type-generation-etpc-apty-reward
Updated • 1 -
cluebbers/Llama-3.1-8B-paraphrase-type-generation-apty-sigmoid
Text Generation • Updated • 7