Automatic Classification of Tweets for Analyzing Communication Behavior of Museums

Nicolas Foucault, Antoine Courtin


Abstract
In this paper, we present a study on tweet classification which aims to define the communication behavior of the 103 French museums that participated in 2014 in the Twitter operation: MuseumWeek. The tweets were automatically classified in four communication categories: sharing experience, promoting participation, interacting with the community, and promoting-informing about the institution. Our classification is multi-class. It combines Support Vector Machines and Naive Bayes methods and is supported by a selection of eighteen subtypes of features of four different kinds: metadata information, punctuation marks, tweet-specific and lexical features. It was tested against a corpus of 1,095 tweets manually annotated by two experts in Natural Language Processing and Information Communication and twelve Community Managers of French museums. We obtained an state-of-the-art result of F1-score of 72% by 10-fold cross-validation. This result is very encouraging since is even better than some state-of-the-art results found in the tweet classification literature.
Anthology ID:
L16-1480
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3006–3013
Language:
URL:
https://aclanthology.org/L16-1480
DOI:
Bibkey:
Cite (ACL):
Nicolas Foucault and Antoine Courtin. 2016. Automatic Classification of Tweets for Analyzing Communication Behavior of Museums. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3006–3013, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Automatic Classification of Tweets for Analyzing Communication Behavior of Museums (Foucault & Courtin, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1480.pdf