Creating a Domain-diverse Corpus for Theory-based Argument Quality Assessment

Lily Ng, Anne Lauscher, Joel Tetreault, Courtney Napoles


Abstract
Computational models of argument quality (AQ) have focused primarily on assessing the overall quality or just one specific characteristic of an argument, such as its convincingness or its clarity. However, previous work has claimed that assessment based on theoretical dimensions of argumentation could benefit writers, but developing such models has been limited by the lack of annotated data. In this work, we describe GAQCorpus, the first large, domain-diverse annotated corpus of theory-based AQ. We discuss how we designed the annotation task to reliably collect a large number of judgments with crowdsourcing, formulating theory-based guidelines that helped make subjective judgments of AQ more objective. We demonstrate how to identify arguments and adapt the annotation task for three diverse domains. Our work will inform research on theory-based argumentation annotation and enable the creation of more diverse corpora to support computational AQ assessment.
Anthology ID:
2020.argmining-1.13
Volume:
Proceedings of the 7th Workshop on Argument Mining
Month:
December
Year:
2020
Address:
Online
Editors:
Elena Cabrio, Serena Villata
Venue:
ArgMining
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
117–126
Language:
URL:
https://aclanthology.org/2020.argmining-1.13
DOI:
Bibkey:
Cite (ACL):
Lily Ng, Anne Lauscher, Joel Tetreault, and Courtney Napoles. 2020. Creating a Domain-diverse Corpus for Theory-based Argument Quality Assessment. In Proceedings of the 7th Workshop on Argument Mining, pages 117–126, Online. Association for Computational Linguistics.
Cite (Informal):
Creating a Domain-diverse Corpus for Theory-based Argument Quality Assessment (Ng et al., ArgMining 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.argmining-1.13.pdf
Code
 grammarly/gaqcorpus