RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation Paper • 2501.08617 • Published 6 days ago • 10
Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse Paper • 2410.21333 • Published Oct 27, 2024 • 10
Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse Paper • 2410.21333 • Published Oct 27, 2024 • 10 • 2
Large Language Models Assume People are More Rational than We Really are Paper • 2406.17055 • Published Jun 24, 2024 • 4 • 4
Large Language Models Assume People are More Rational than We Really are Paper • 2406.17055 • Published Jun 24, 2024 • 4 • 4
Large Language Models Assume People are More Rational than We Really are Paper • 2406.17055 • Published Jun 24, 2024 • 4 • 4
Large Language Models Assume People are More Rational than We Really are Paper • 2406.17055 • Published Jun 24, 2024 • 4 • 4
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs Paper • 2307.10168 • Published Jul 19, 2023 • 10
ReviewerGPT? An Exploratory Study on Using Large Language Models for Paper Reviewing Paper • 2306.00622 • Published Jun 1, 2023 • 1