Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mayankagarwal
's Collections
RLHF + Code
RLHF + Code
updated
Nov 22, 2024
Upvote
-
Vezora/Code-Preference-Pairs
Viewer
•
Updated
Jul 28, 2024
•
54k
•
59
•
18
quangduc1112001/python-code-DPO-fine-tune
Viewer
•
Updated
Nov 4, 2024
•
2k
•
46
•
2
xinlai/Math-Step-DPO-10K
Viewer
•
Updated
Jul 4, 2024
•
10.8k
•
702
•
46
minfeng-ai/leetcode_preference
Viewer
•
Updated
Sep 6, 2023
•
457
•
21
•
6
Magpie-Align/Magpie-Llama-3.1-Pro-DPO-100K-v0.1
Viewer
•
Updated
Aug 22, 2024
•
100k
•
105
•
5
openbmb/UltraInteract_pair
Viewer
•
Updated
Apr 5, 2024
•
220k
•
368
•
106
NextWealth/Python-DPO-Large
Viewer
•
Updated
Jul 2, 2024
•
957
•
35
interstellarninja/tool-calls-dpo
Viewer
•
Updated
Jan 23, 2024
•
235
•
44
•
9
Upvote
-
Share collection
View history
Collection guide
Browse collections