---
pipeline_tag: text-generation
---
This is the official checkpoint of feedback model trained using COFFEE-GYM with PPO strategy. 

This model generates natural language feedback given an erroneous code.

For further detials, please see our paper.

https://huggingface.co/spaces/Coffee-Gym/Project-Coffee-Gym