Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7
1
xz
mxz
Follow
0 followers
·
3 following
AI & ML interests
NLP ML RL
Organizations
None yet
models
4
Sort: Recently updated
mxz/llama3-8b-dpo
Text Generation
•
Updated
Jul 28, 2024
•
1
mxz/llama3-8b-ppo
Text Generation
•
Updated
Jul 28, 2024
•
1
mxz/llama3-8b-sft
Text Generation
•
Updated
Jul 28, 2024
•
1
mxz/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jul 17, 2024
datasets
4
Sort: Recently updated
mxz/awesome-dpo
Viewer
•
Updated
Jul 28, 2024
•
302k
•
35
mxz/CValues
Viewer
•
Updated
Jul 26, 2024
•
146k
•
33
mxz/CValues_DPO
Viewer
•
Updated
Jul 26, 2024
•
146k
•
32
mxz/alpaca_en_zh_ruozhiba_gpt4-data
Viewer
•
Updated
Jul 26, 2024
•
190k
•
38