Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jikaixuan
/
zephyr-7b
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
8cd461b
zephyr-7b
/
train_results.json
Commit History
Model save
8cd461b
verified
jikaixuan
commited on
Mar 30, 2024
Model save
346a175
verified
jikaixuan
commited on
Mar 28, 2024
Model save
3ec4a5c
verified
jikaixuan
commited on
Mar 28, 2024
Model save
6b1b603
verified
jikaixuan
commited on
Mar 21, 2024
Model save
16bc4cf
verified
jikaixuan
commited on
Mar 20, 2024