Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 16 days ago • 36
CoTA Datasets Collection This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated 16 days ago • 5
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated 16 days ago • 4
Salesforce/xgen-mm-vid-phi3-mini-r-v1.5-128tokens-16frames Image-Text-to-Text • Updated 18 days ago • 1 • 2
CoTA Datasets Collection This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated 16 days ago • 5
XGen-MM-1 models and datasets Collection A collection of all XGen-MM (Foundation LMM) models! • 16 items • Updated 17 days ago • 38