--- pipeline_tag: text-generation --- This is the official checkpoint of feedback model trained using COFFEE-GYM with PPO strategy. This model generates natural language feedback given an erroneous code. For further detials, please see our paper. https://huggingface.co/spaces/Coffee-Gym/Project-Coffee-Gym