Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 14 days ago • 41
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published 14 days ago • 21
Agent-SafetyBench: Evaluating the Safety of LLM Agents Paper • 2412.14470 • Published 18 days ago • 11
RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios Paper • 2412.08972 • Published 25 days ago • 9
VisionArena: 230K Real World User-VLM Conversations with Preference Labels Paper • 2412.08687 • Published 25 days ago • 13
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Paper • 2412.09605 • Published 24 days ago • 26
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 23 days ago • 136
Finance Commons Collection A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17, 2024 • 7