Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
zhou-xl
/
xpo-lla-3-8b-instruct
like
0
PyTorch
princeton-nlp/llama3-ultrafeedback
llama
License:
mit
Model card
Files
Files and versions
Community
Train
zhou-xl
commited on
Dec 25, 2024
Commit
8246343
·
verified
·
1 Parent(s):
a3b5fff
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+2
-2
README.md
CHANGED
Viewed
@@ -7,5 +7,5 @@ license: mit
7
---
8
9
10
-
a simpo-like DPO method
11
-
AlpacaEval:44.8
7
---
8
9
10
+
a simpo-like DPO method
, trained on simpo data
11
+
AlpacaEval:44.8
(+2)