Carlo Strapparava


2019

pdf bib
Proceedings of the Tenth Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis
Alexandra Balahur | Roman Klinger | Veronique Hoste | Carlo Strapparava | Orphee De Clercq
Proceedings of the Tenth Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis

pdf bib
Personality Traits Recognition in Literary Texts
Daniele Pizzolli | Carlo Strapparava
Proceedings of the Second Workshop on Storytelling

Interesting stories often are built around interesting characters. Finding and detailing what makes an interesting character is a real challenge, but certainly a significant cue is the character personality traits. Our exploratory work tests the adaptability of the current personality traits theories to literal characters, focusing on the analysis of utterances in theatre scripts. And, at the opposite, we try to find significant traits for interesting characters. The preliminary results demonstrate that our approach is reasonable. Using machine learning for gaining insight into the personality traits of fictional characters can make sense.

pdf bib
Anglicized Words and Misspelled Cognates in Native Language Identification
Ilia Markov | Vivi Nastase | Carlo Strapparava
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications

In this paper, we present experiments that estimate the impact of specific lexical choices of people writing in a second language (L2). In particular, we look at misspelled words that indicate lexical uncertainty on the part of the author, and separate them into three categories: misspelled cognates, “L2-ed” (in our case, anglicized) words, and all other spelling errors. We test the assumption that such errors contain clues about the native language of an essay’s author through the task of native language identification. The results of the experiments show that the information brought by each of these categories is complementary. We also note that while the distribution of such features changes with the proficiency level of the writer, their contribution towards native language identification remains significant at all levels.

2018

pdf bib
Punctuation as Native Language Interference
Ilia Markov | Vivi Nastase | Carlo Strapparava
Proceedings of the 27th International Conference on Computational Linguistics

In this paper, we describe experiments designed to explore and evaluate the impact of punctuation marks on the task of native language identification. Punctuation is specific to each language, and is part of the indicators that overtly represent the manner in which each language organizes and conveys information. Our experiments are organized in various set-ups: the usual multi-class classification for individual languages, also considering classification by language groups, across different proficiency levels, topics and even cross-corpus. The results support our hypothesis that punctuation marks are persistent and robust indicators of the native language of the author, which do not diminish in influence even when a high proficiency level in a non-native language is achieved.

pdf bib
A Computational Exploration of Exaggeration
Enrica Troiano | Carlo Strapparava | Gözde Özbal | Serra Sinem Tekiroğlu
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

Several NLP studies address the problem of figurative language, but among non-literal phenomena, they have neglected exaggeration. This paper presents a first computational approach to this figure of speech. We explore the possibility to automatically detect exaggerated sentences. First, we introduce HYPO, a corpus containing overstatements (or hyperboles) collected on the web and validated via crowdsourcing. Then, we evaluate a number of models trained on HYPO, and bring evidence that the task of hyperbole identification can be successfully performed based on a small set of semantic features.

pdf bib
Metaphor: A Computational Perspective by Tony Veale, Ekaterina Shutova and Beata Beigman Klebanov
Carlo Strapparava
Computational Linguistics, Volume 44, Issue 1 - April 2018

pdf bib
The Role of Emotions in Native Language Identification
Ilia Markov | Vivi Nastase | Carlo Strapparava | Grigori Sidorov
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis

We explore the hypothesis that emotion is one of the dimensions of language that surfaces from the native language into a second language. To check the role of emotions in native language identification (NLI), we model emotion information through polarity and emotion load features, and use document representations using these features to classify the native language of the author. The results indicate that emotion is relevant for NLI, even for high proficiency levels and across topics.

2017

pdf bib
Improving Native Language Identification by Using Spelling Errors
Lingzhen Chen | Carlo Strapparava | Vivi Nastase
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

In this paper, we explore spelling errors as a source of information for detecting the native language of a writer, a previously under-explored area. We note that character n-grams from misspelled words are very indicative of the native language of the author. In combination with other lexical features, spelling error features lead to 1.2% improvement in accuracy on classifying texts in the TOEFL11 corpus by the author’s native language, compared to systems participating in the NLI shared task.

pdf bib
Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism
Octavian Popescu | Carlo Strapparava
Proceedings of the 2017 EMNLP Workshop: Natural Language Processing meets Journalism

pdf bib
CIC-FBK Approach to Native Language Identification
Ilia Markov | Lingzhen Chen | Carlo Strapparava | Grigori Sidorov
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications

We present the CIC-FBK system, which took part in the Native Language Identification (NLI) Shared Task 2017. Our approach combines features commonly used in previous NLI research, i.e., word n-grams, lemma n-grams, part-of-speech n-grams, and function words, with recently introduced character n-grams from misspelled words, and features that are novel in this task, such as typed character n-grams, and syntactic n-grams of words and of syntactic relation tags. We use log-entropy weighting scheme and perform classification using the Support Vector Machines (SVM) algorithm. Our system achieved 0.8808 macro-averaged F1-score and shared the 1st rank in the NLI Shared Task 2017 scoring.

pdf bib
Word Etymology as Native Language Interference
Vivi Nastase | Carlo Strapparava
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

We present experiments that show the influence of native language on lexical choice when producing text in another language – in this particular case English. We start from the premise that non-native English speakers will choose lexical items that are close to words in their native language. This leads us to an etymology-based representation of documents written by people whose mother tongue is an Indo-European language. Based on this representation we grow a language family tree, that matches closely the Indo-European language tree.

pdf bib
A Computational Analysis of the Language of Drug Addiction
Carlo Strapparava | Rada Mihalcea
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers

We present a computational analysis of the language of drug users when talking about their drug experiences. We introduce a new dataset of over 4,000 descriptions of experiences reported by users of four main drug types, and show that we can predict with an F1-score of up to 88% the drug behind a certain experience. We also perform an analysis of the dominant psycholinguistic processes and dominant emotions associated with each drug type, which sheds light on the characteristics of drug users.

pdf bib
To Sing like a Mockingbird
Lorenzo Gatti | Gözde Özbal | Oliviero Stock | Carlo Strapparava
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers

Musical parody, i.e. the act of changing the lyrics of an existing and very well-known song, is a commonly used technique for creating catchy advertising tunes and for mocking people or events. Here we describe a system for automatically producing a musical parody, starting from a corpus of songs. The system can automatically identify characterizing words and concepts related to a novel text, which are taken from the daily news. These concepts are then used as seeds to appropriately replace part of the original lyrics of a song, using metrical, rhyming and lexical constraints. Finally, the parody can be sung with a singing speech synthesizer, with no intervention from the user.

2016

pdf bib
Emotions and NLP: Future Directions
Carlo Strapparava
Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis

pdf bib
Innovative Semi-Automatic Methodology to Annotate Emotional Corpora
Lea Canales | Carlo Strapparava | Ester Boldrini | Patricio Martínez-Barco
Proceedings of the Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media (PEOPLES)

Detecting depression or personality traits, tutoring and student behaviour systems, or identifying cases of cyber-bulling are a few of the wide range of the applications, in which the automatic detection of emotion is a crucial element. Emotion detection has the potential of high impact by contributing the benefit of business, society, politics or education. Given this context, the main objective of our research is to contribute to the resolution of one of the most important challenges in textual emotion detection task: the problems of emotional corpora annotation. This will be tackled by proposing of a new semi-automatic methodology. Our innovative methodology consists in two main phases: (1) an automatic process to pre-annotate the unlabelled sentences with a reduced number of emotional categories; and (2) a refinement manual process where human annotators will determine which is the predominant emotion between the emotional categories selected in the phase 1. Our proposal in this paper is to show and evaluate the pre-annotation process to analyse the feasibility and the benefits by the methodology proposed. The results obtained are promising and allow obtaining a substantial improvement of annotation time and cost and confirm the usefulness of our pre-annotation process to improve the annotation task.

pdf bib
PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
Gözde Özbal | Carlo Strapparava | Serra Sinem Tekiroğlu
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

Proverbs are commonly metaphoric in nature and the mapping across domains is commonly established in proverbs. The abundance of proverbs in terms of metaphors makes them an extremely valuable linguistic resource since they can be utilized as a gold standard for various metaphor related linguistic tasks such as metaphor identification or interpretation. Besides, a collection of proverbs fromvarious languages annotated with metaphors would also be essential for social scientists to explore the cultural differences betweenthose languages. In this paper, we introduce PROMETHEUS, a dataset consisting of English proverbs and their equivalents in Italian.In addition to the word-level metaphor annotations for each proverb, PROMETHEUS contains other types of information such as the metaphoricity degree of the overall proverb, its meaning, the century that it was first recorded in and a pair of subjective questions responded by the annotators. To the best of our knowledge, this is the first multi-lingual and open-domain corpus of proverbs annotated with word-level metaphors.

pdf bib
Learning to Identify Metaphors from a Corpus of Proverbs
Gözde Özbal | Carlo Strapparava | Serra Sinem Tekiroğlu | Daniele Pighin
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

2015

pdf bib
Echoes of Persuasion: The Effect of Euphony in Persuasive Communication
Marco Guerini | Gözde Özbal | Carlo Strapparava
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

pdf bib
Exploring Sensorial Features for Metaphor Identification
Serra Sinem Tekiroğlu | Gözde Özbal | Carlo Strapparava
Proceedings of the Third Workshop on Metaphor in NLP

pdf bib
SemEval-2015 Task 9: CLIPEval Implicit Polarity of Events
Irene Russo | Tommaso Caselli | Carlo Strapparava
Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)

pdf bib
SemEval 2015, Task 7: Diachronic Text Evaluation
Octavian Popescu | Carlo Strapparava
Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)

2014

pdf bib
Automation and Evaluation of the Keyword Method for Second Language Learning
Gözde Özbal | Daniele Pighin | Carlo Strapparava
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

pdf bib
Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources
Spandana Gella | Carlo Strapparava | Vivi Nastase
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

pdf bib
Creative language explorations through a high-expressivity N-grams query language
Carlo Strapparava | Lorenzo Gatti | Marco Guerini | Oliviero Stock
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

pdf bib
Enriching the “Senso Comune” Platform with Automatically Acquired Data
Tommaso Caselli | Laure Vieu | Carlo Strapparava | Guido Vetere
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

pdf bib
Aligning an Italian WordNet with a Lexicographic Dictionary: Coping with limited data
Tommaso Caselli | Carlo Strapparava | Laure Vieu | Guido Vetere
Proceedings of the Seventh Global Wordnet Conference

pdf bib
A Computational Approach to Generate a Sensorial Lexicon
Serra Sinem Tekiroğlu | Gözde Özbal | Carlo Strapparava
Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon (CogALex)

pdf bib
Automatic Domain Assignment for Word Sense Alignment
Tommaso Caselli | Carlo Strapparava
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

pdf bib
Sensicon: An Automatically Constructed Sensorial Lexicon
Serra Sinem Tekiroğlu | Gözde Özbal | Carlo Strapparava
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

2013

pdf bib
Bridging Languages through Etymology: The case of cross language text categorization
Vivi Nastase | Carlo Strapparava
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

pdf bib
BRAINSUP: Brainstorming Support for Creative Sentence Generation
Gözde Özbal | Daniele Pighin | Carlo Strapparava
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

pdf bib
Behind the Times: Detecting Epoch Changes using Large Corpora
Octavian Popescu | Carlo Strapparava
Proceedings of the Sixth International Joint Conference on Natural Language Processing

pdf bib
Aligning Verb Senses in Two Italian Lexical Semantic Resources
Tommaso Caselli | Carlo Strapparava | Laure Vieu | Guido Vetere
Proceedings of the Joint Symposium on Semantic Processing. Textual Inference and Structures in Corpora

pdf bib
Linguistic Linked Data for Sentiment Analysis
Paul Buitelaar | Mihael Arcan | Carlos Iglesias | Fernando Sánchez-Rada | Carlo Strapparava
Proceedings of the 2nd Workshop on Linked Data in Linguistics (LDL-2013): Representing and linking lexicons, terminologies and other language data

2012

pdf bib
A Computational Approach to the Automation of Creative Naming
Gözde Özbal | Carlo Strapparava
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

pdf bib
Ecological Evaluation of Persuasive Messages Using Google AdWords
Marco Guerini | Carlo Strapparava | Oliviero Stock
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

pdf bib
Brand Pitt: A Corpus to Explore the Art of Naming
Gözde Özbal | Carlo Strapparava | Marco Guerini
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

pdf bib
A Parallel Corpus of Music and Lyrics Annotated with Emotions
Carlo Strapparava | Rada Mihalcea | Alberto Battocchi
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

pdf bib
NgramQuery - Smart Information Extraction from Google N-gram using External Resources
Martin Aleksandrov | Carlo Strapparava
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

pdf bib
Lyrics, Music, and Emotions
Rada Mihalcea | Carlo Strapparava
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning

pdf bib
Corpus-based Explorations of Affective Load Differences in Arabic-Hebrew-English
Carlo Strapparava | Oliviero Stock | Ilai Alon
Proceedings of COLING 2012: Posters

2010

pdf bib
Proceedings of the 5th International Workshop on Semantic Evaluation
Katrin Erk | Carlo Strapparava
Proceedings of the 5th International Workshop on Semantic Evaluation

pdf bib
Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text
Diana Inkpen | Carlo Strapparava
Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text

pdf bib
The Color of Emotions in Texts
Carlo Strapparava | Gozde Ozbal
Proceedings of the 2nd Workshop on Cognitive Aspects of the Lexicon

pdf bib
Evaluation Metrics for Persuasive NLP with Google AdWords
Marco Guerini | Carlo Strapparava | Oliviero Stock
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

pdf bib
Studying the Lexicon of Dialogue Acts
Nicole Novielli | Carlo Strapparava
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

pdf bib
Predicting Persuasiveness in Political Discourses
Carlo Strapparava | Marco Guerini | Oliviero Stock
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

2009

pdf bib
Towards Unsupervised Recognition of Dialogue Acts
Nicole Novielli | Carlo Strapparava
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium

pdf bib
Kernel Methods for Minimally Supervised WSD
Claudio Giuliano | Alfio Massimiliano Gliozzo | Carlo Strapparava
Computational Linguistics, Volume 35, Number 4, December 2009

pdf bib
The Lie Detector: Explorations in the Automatic Recognition of Deceptive Language
Rada Mihalcea | Carlo Strapparava
Proceedings of the ACL-IJCNLP 2009 Conference Short Papers

2008

pdf bib
Resources for Persuasion
Marco Guerini | Carlo Strapparava | Oliviero Stock
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

pdf bib
Valentino: A Tool for Valence Shifting of Natural Language Texts
Marco Guerini | Carlo Strapparava | Oliviero Stock
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

2007

pdf bib
SemEval-2007 Task 14: Affective Text
Carlo Strapparava | Rada Mihalcea
Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)

pdf bib
FBK-irst: Lexical Substitution Task Exploiting Domain and Syntagmatic Coherence
Claudio Giuliano | Alfio Gliozzo | Carlo Strapparava
Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)

2006

pdf bib
Direct Word Sense Matching for Lexical Substitution
Ido Dagan | Oren Glickman | Alfio Gliozzo | Efrat Marmorshtein | Carlo Strapparava
Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics

pdf bib
Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization
Alfio Gliozzo | Carlo Strapparava
Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics

pdf bib
The Affective Weight of Lexicon
Carlo Strapparava | Alessandro Valitutti | Oliviero Stock
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

pdf bib
Syntagmatic Kernels: a Word Sense Disambiguation Case Study
Claudio Giuliano | Alfio Gliozzo | Carlo Strapparava
Proceedings of the Workshop on Learning Structured Information in Natural Language Applications

2005

pdf bib
Domain Kernels for Word Sense Disambiguation
Alfio Gliozzo | Claudio Giuliano | Carlo Strapparava
Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05)

pdf bib
HAHAcronym: A Computational Humor System
Oliviero Stock | Carlo Strapparava
Proceedings of the ACL Interactive Poster and Demonstration Sessions

pdf bib
Investigating Unsupervised Learning for Text Categorization Bootstrapping
Alfio Gliozzo | Carlo Strapparava | Ido Dagan
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing

pdf bib
Making Computers Laugh: Investigations in Automatic Humor Recognition
Rada Mihalcea | Carlo Strapparava
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing

pdf bib
Domain Kernels for Text Categorization
Alfio Gliozzo | Carlo Strapparava
Proceedings of the Ninth Conference on Computational Natural Language Learning (CoNLL-2005)

pdf bib
Cross Language Text Categorization by Acquiring Multilingual Domain Models from Comparable Corpora
Alfio Gliozzo | Carlo Strapparava
Proceedings of the ACL Workshop on Building and Using Parallel Texts

2004

pdf bib
Pattern abstraction and term similarity for Word Sense Disambiguation: IRST at Senseval-3
Carlo Strapparava | Alfio Gliozzo | Claudio Giuliano
Proceedings of SENSEVAL-3, the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text

pdf bib
The “Meaning” system on the English all-words task
Luís Villarejo | Lluis Màrquez | Eneko Agirre | David Martínez | Bernardo Magnini | Carlo Strapparava | Diana McCarthy | Andrés Montoyo | Armando Suárez
Proceedings of SENSEVAL-3, the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text

pdf bib
Unsupervised Domain Relevance Estimation for Word Sense Disambiguation
Alfio Gliozzo | Bernardo Magnini | Carlo Strapparava
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing

pdf bib
WordNet Affect: an Affective Extension of WordNet
Carlo Strapparava | Alessandro Valitutti
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

2001

pdf bib
Using Domain Information for Word Sense Disambiguation
Bernardo Magnini | Carlo Strapparava | Giovanni Pezzulo | Alfio Gliozzo
Proceedings of SENSEVAL-2 Second International Workshop on Evaluating Word Sense Disambiguation Systems

2000

pdf bib
Experiments in Word Domain Disambiguation for Parallel Texts
Bernardo Magnini | Carlo Strapparava
ACL-2000 Workshop on Word Senses and Multi-linguality

1997

pdf bib
Lexical Discrimination with the Italian Version of WordNet
Alessandro Artale | Bernardo Magnini | Carlo Strapparava
Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications

1992

pdf bib
AN APPROACH TO MULTILEVEL SEMANTICS FOR APPLIED SYSTEMS
Alberto Lavelli | Bernardo Magnini | Carlo Strapparava
Third Conference on Applied Natural Language Processing