Jiaxin Huang's picture

1 2

Jiaxin Huang

teapot123

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

upvoted a paper 3 months ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

commented on a paper 3 months ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

View all activity

Organizations

teapot123's activity

upvoted a paper 4 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 6 days ago • 35

upvoted a paper 3 months ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Paper • 2410.09724 • Published Oct 13, 2024 • 2