arxiv:2407.12852
Tony Montes
t-montes
ยท
AI & ML interests
NLP, GenAI
Recent Activity
upvoted
a
paper
14 days ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
liked
a model
19 days ago
impira/layoutlm-document-qa
upvoted
a
paper
26 days ago
RL Zero: Zero-Shot Language to Behaviors without any Supervision
Organizations
Papers
2
spaces
5
models
None public yet
datasets
None public yet