cristinae
/

roberta_politicalStanceClassifier

Text Classification

Model card Files Files and versions Community

roberta_politicalStanceClassifier / README.md

cristinae's picture

Update README.md

290eda6 verified 9 months ago

|

history blame contribute delete

2.24 kB

	---
	license: lgpl
	language:
	- en
	- de
	- es
	- ca
	metrics:
	- accuracy
	pipeline_tag: text-classification
	tags:
	- stance
	- classification
	- pytorch
	- multilingual
	---

	## Model Description

	<!-- Provide a longer summary of what this model is. -->

	- Developed by: Cristina España-Bonet
	- Model type: Binary stance classifier on top of XLM-RoBERTa
	- Language(s) (NLP): English, German and Spanish
	- License: LGPL
	- Finetuned from model: XLM-RoBERTa Large

	## Model Sources

	<!-- Provide the basic links for the model. -->

	- Repository: https://github.com/cristinae/docTransformer
	- Data: https://zenodo.org/records/8417761
	- Paper: https://aclanthology.org/2023.findings-emnlp.787/


	## Direct Use

	Determine the political stance of a (newspaper) article. Binary classification: left vs. right stance

	#### Evaluation

	```srun --ntasks 1 --gpus-per-task 1 python -u docClassifier.py --task evaluation -f ./ -o politicalStanceLvsR_en.bin --test_dataset your.test```

	#### Classification

	```srun --ntasks 1 --gpus-per-task 1 python -u docClassifier.py --task classification -f ./ -o politicalStanceLvsR_en.bin --test_dataset your.test```

	## Citation [optional]

	<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->

	BibTeX:
	```
	@inproceedings{espana-bonet:2023,
	title = "Multilingual Coarse Political Stance Classification of Media. The Editorial Line of a {C}hat{GPT} and Bard Newspaper",
	author = "Espa{\~n}a-Bonet, Cristina",
	editor = "Bouamor, Houda and Pino, Juan and Bali, Kalika",
	booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2023",
	month = dec,
	year = "2023",
	address = "Singapore",
	publisher = "Association for Computational Linguistics",
	url = "https://aclanthology.org/2023.findings-emnlp.787",
	doi = "10.18653/v1/2023.findings-emnlp.787",
	pages = "11757--11777"
	}
	```

	APA:

	España-Bonet, Cristina. (2023, December). Multilingual Coarse Political Stance Classification of Media. The Editorial Line of a ChatGPT and Bard Newspaper. In Findings of the Association for Computational Linguistics: EMNLP 2023 (pp. 11757-11777).