zhaolulu
zll666
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
13 days ago
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Organizations
models
None public yet
datasets
None public yet