Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
berkeley-nest
/
Starling-RM-7B-alpha
like
102
Follow
Berkeley-Nest
65
Transformers
PyTorch
berkeley-nest/Nectar
English
llama
reward model
RLHF
RLAIF
text-generation-inference
Inference Endpoints
arxiv:
2203.02155
arxiv:
2301.11270
License:
apache-2.0
Model card
Files
Files and versions
Community
7
Train
Deploy
Use this model
5a58bd5
Starling-RM-7B-alpha
Commit History
fix link
5a58bd5
evanfrick
commited on
Nov 27, 2023
fix link
a55a459
evanfrick
commited on
Nov 27, 2023
Update README.md
8a237bb
banghua
commited on
Nov 27, 2023
Update README.md
d0844f8
banghua
commited on
Nov 27, 2023
Update README.md
687b937
banghua
commited on
Nov 27, 2023
Update README.md
73a3756
banghua
commited on
Nov 27, 2023
Update README.md
b5fc156
banghua
commited on
Nov 27, 2023
Update README.md
36eccc9
banghua
commited on
Nov 27, 2023
Update README.md
9dcf3eb
banghua
commited on
Nov 27, 2023
Delete global_step1400
de54e9f
banghua
commited on
Nov 26, 2023
Create README.md
126c676
banghua
commited on
Nov 26, 2023
Duplicate from banghua/n_rm
6f8f5dc
banghua
commited on
Nov 25, 2023