arxiv:2410.10074
Jiaxin Huang
teapot123
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 months ago
Taming Overconfidence in LLMs: Reward Calibration in RLHF
commented on
a paper
3 months ago
Taming Overconfidence in LLMs: Reward Calibration in RLHF
Organizations
models
None public yet
datasets
None public yet