World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models Paper • 2306.08685 • Published Jun 14, 2023 • 1
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Paper • 2402.19446 • Published Feb 29, 2024
DANLI: Deliberative Agent for Following Natural Language Instructions Paper • 2210.12485 • Published Oct 22, 2022
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Paper • 2405.10292 • Published May 16, 2024 • 1
Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models Paper • 2409.04787 • Published Sep 7, 2024