Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
allenai
/
tulu-v2.5-dpo-13b-hh-rlhf
like
1
Follow
Ai2
1.87k
Text Generation
Transformers
Safetensors
allenai/tulu-2.5-preference-data
allenai/tulu-v2-sft-mixture
English
llama
conversational
text-generation-inference
Inference Endpoints
arxiv:
2406.09279
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
d821006
tulu-v2.5-dpo-13b-hh-rlhf
/
README.md
Commit History
Update README.md
d821006
verified
hamishivi
commited on
Jun 14, 2024
Update README.md
8a8c08e
verified
hamishivi
commited on
Jun 12, 2024
Create README.md
c4a2500
verified
hamishivi
commited on
Jun 12, 2024