See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Recent Activity
updated
a dataset
3 minutes ago
ZhangShenao/new-Mistral-7B-Instruct-v0.2-gsm
updated
a model
about 1 hour ago
ZhangShenao/math_metamath-gemma-1.1-7b-it-e-iter-1_sample_1000_tp
updated
a model
about 9 hours ago
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-m-iter-1_sample_7000_tp
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 69 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 31 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 33 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 15
models
62
ZhangShenao/math_metamath-gemma-1.1-7b-it-e-iter-1_sample_1000_tp
Updated
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-m-iter-1_sample_7000_tp
Updated
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-e-iter-1_sample_7000_tp
Updated
ZhangShenao/Mistral-7B-Instruct-v0.2-e-iter-1_gsm
Updated
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-m-iter-1_sample_1000_tp
Updated
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-e-iter-1_sample_1000_tp
Updated
ZhangShenao/math_metamath-Meta-Llama-3-8B-Instruct-m-iter-1_sample_1000_tp
Updated
•
2
ZhangShenao/math_metamath-Meta-Llama-3-8B-Instruct-e-iter-1_sample_1000_tp
Updated
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-sft-sample_7000_tp
Updated
•
2
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-m-iter-1_sample_7000_tp
Updated
•
2
datasets
37
ZhangShenao/new-Mistral-7B-Instruct-v0.2-gsm
Updated
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-iter1_sample_7000_tp
Updated
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-iter1_sample_1000_tp
Viewer
•
Updated
•
1k
ZhangShenao/math_metamath-Meta-Llama-3-8B-Instruct-iter1_sample_1000_tp
Viewer
•
Updated
•
1k
•
1
ZhangShenao/sft-math_gsm-Meta-Llama-3-8B-Instruct-iter_sample_7000_tp
Viewer
•
Updated
•
7k
•
1
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-iter1_sample_7000_tp
Viewer
•
Updated
•
7k
•
1
ZhangShenao/sft-math_metamath-Meta-Llama-3-8B-Instruct-iter_sample_1000_tp
Viewer
•
Updated
•
1k
•
2
ZhangShenao/sft-math_metamath-Meta-Llama-3-8B-Instruct-iter_sample_7000_tp
Viewer
•
Updated
•
7k
•
1
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-iter1_sample_1000_tp
Viewer
•
Updated
•
1k
•
2
ZhangShenao/new-Mistral-7B-Instruct-v0.2-iter1_sample_1000_tp
Viewer
•
Updated
•
1k
•
2