TeamCoffee's picture
Update README.md
42f2505 verified
metadata
pipeline_tag: text-generation

This is the official checkpoint of feedback model trained using COFFEE-GYM with PPO strategy.

This model generates natural language feedback given an erroneous code.

For further detials, please see our paper.

https://huggingface.co/spaces/Coffee-Gym/Project-Coffee-Gym