rl-llm-agent
/

Llama-3.2-3B-Instruct-online-dpo-alfworld-iter2

Model card Files Files and versions Community

No model card

Downloads last month: 126

Inference API

Unable to determine this model's library. Check the docs .