arxiv:2404.09937
Junxian He
jxhe
AI & ML interests
None yet
Recent Activity
liked
a model
11 days ago
deepseek-ai/DeepSeek-V3
upvoted
a
paper
13 days ago
Diving into Self-Evolving Training for Multimodal Reasoning
upvoted
a
paper
13 days ago
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Organizations
models
None public yet
datasets
None public yet