Wiki GPT
Code to generate GPT-wiki-intro dataset. This dataset can be used to train and evaluate mdoels to classify text as human written or GPT generated.
To run the code -
- Install requirements from requirements.txt
- Create config.py and add your
OPENAI_API_KEY
- Create a
data
folder - Run
process_wiki_data.ipynb
- Run
download_openai.ipynb
- Run
process_results.ipynb
Citation
Please cite as following
@misc {aaditya_bhat_2023,
author = { {Aaditya Bhat} },
title = { GPT-wiki-intro (Revision 0e458f5) },
year = 2023,
url = { https://huggingface.co/datasets/aadityaubhat/GPT-wiki-intro },
doi = { 10.57967/hf/0326 },
publisher = { Hugging Face }
}