osanseviero
's Collections
Papers I've read
updated
Chain-of-Thought Reasoning Without Prompting
Paper
•
2402.10200
•
Published
•
104
Large Language Models Cannot Self-Correct Reasoning Yet
Paper
•
2310.01798
•
Published
•
33
Premise Order Matters in Reasoning with Large Language Models
Paper
•
2402.08939
•
Published
•
27
Chain of Thought Empowers Transformers to Solve Inherently Serial
Problems
Paper
•
2402.12875
•
Published
•
13
ReAct: Synergizing Reasoning and Acting in Language Models
Paper
•
2210.03629
•
Published
•
15
WebShop: Towards Scalable Real-World Web Interaction with Grounded
Language Agents
Paper
•
2207.01206
•
Published
•
2
Optimizing Instructions and Demonstrations for Multi-Stage Language
Model Programs
Paper
•
2406.11695
•
Published
•
1
Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better
Together
Paper
•
2407.10930
•
Published
SWE-agent: Agent-Computer Interfaces Enable Automated Software
Engineering
Paper
•
2405.15793
•
Published
•
2
OpenDevin: An Open Platform for AI Software Developers as Generalist
Agents
Paper
•
2407.16741
•
Published
•
68
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work
Tasks?
Paper
•
2403.07718
•
Published
•
1
WorkArena++: Towards Compositional Planning and Reasoning-based Common
Knowledge Work Tasks
Paper
•
2407.05291
•
Published
•
2
Beyond A*: Better Planning with Transformers via Search Dynamics
Bootstrapping
Paper
•
2402.14083
•
Published
•
47
Dualformer: Controllable Fast and Slow Thinking by Learning with
Randomized Reasoning Traces
Paper
•
2410.09918
•
Published
•
3