Jiaxin Huang's picture

1 2

Jiaxin Huang

teapot123

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

upvoted a paper 3 months ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

commented on a paper 3 months ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

View all activity

Organizations

Papers 5

arxiv:2410.10074

arxiv:2410.09724

arxiv:2405.04086

arxiv:2211.03044

models

None public yet

datasets

None public yet