Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ayoubkirouane
/
Mistral-SLERP-Merged7B-DPO
like
0
Text Generation
PEFT
Safetensors
HuggingFaceH4/ultrafeedback_binarized
ayoubkirouane/Orca-Direct-Preference-Optimization
trl
dpo
unsloth
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
9993da0
Mistral-SLERP-Merged7B-DPO
/
README.md
Commit History
Update README.md
9993da0
verified
ayoubkirouane
commited on
Jan 24, 2024
Update README.md
5ec86cf
verified
ayoubkirouane
commited on
Jan 24, 2024
ayoubkirouane/Mistral-SLERP-Merged7B-DPO
4907099
verified
ayoubkirouane
commited on
Jan 24, 2024