Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
yunconglong
/
7Bx4_DPO
like
2
Text Generation
Transformers
Safetensors
mixtral
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
yunconglong
commited on
Jan 20, 2024
Commit
6fb1fdb
·
verified
·
1 Parent(s):
bb59d3d
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+6
-0
README.md
ADDED
Viewed
@@ -0,0 +1,6 @@
1
+
---
2
+
license: mit
3
+
---
4
+
5
+
6
+
* [DPO Trainer](https://huggingface.co/docs/trl/main/en/dpo_trainer) by jondurbin/truthy-dpo-v0.1 first 50 cases