Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
allenai
/
tulu-v2.5-ppo-13b-hh-rlhf-60k
like
0
Follow
Ai2
1.94k
Text Generation
Transformers
Safetensors
allenai/tulu-2.5-preference-data
allenai/tulu-v2-sft-mixture
English
llama
conversational
text-generation-inference
Inference Endpoints
arxiv:
2406.09279
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
9172f4e
tulu-v2.5-ppo-13b-hh-rlhf-60k
Commit History
Update tokenizer_config.json
9172f4e
verified
hamishivi
commited on
Jun 12, 2024
Update README.md
49ec259
verified
hamishivi
commited on
Jun 12, 2024
Update config.json
fda9a16
verified
hamishivi
commited on
Jun 12, 2024
Create README.md
b72b74e
verified
hamishivi
commited on
Jun 12, 2024
Upload folder using huggingface_hub
370721a
verified
hamishivi
commited on
Jun 11, 2024
initial commit
f1deeb0
verified
hamishivi
commited on
Jun 11, 2024