Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ContextualAI
/
Contextual_KTO_Mistral_PairRM
like
31
Follow
ContextualAI
65
Text Generation
Transformers
Safetensors
snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
English
mistral
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.01306
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
d8380f4
Contextual_KTO_Mistral_PairRM
Commit History
Upload MistralForCausalLM
d8380f4
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
eb151d5
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
66b1fa9
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
e531f7b
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
0d81ff6
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
231bafb
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
dba0d32
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
257fdd0
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
c96f499
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
c47c194
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
0b9cba1
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
f652cba
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
cbf882a
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
45df619
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
2c7e3b1
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
8922fc2
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
927e33a
verified
Muennighoff
commited on
Mar 5, 2024
initial commit
4564abd
verified
xwinxu
commited on
Mar 5, 2024