Proceedings of the 2nd Workshop on the Use of Computational Methods in the Study of Endangered Languages

W17-0101 [bib]: Dustin Bowers; Antti Arppe; Jordan Lachler; Sjur Moshagen; Trond Trosterud
A Morphological Parser for Odawa

W17-0102 [bib]: Ghazaleh Kazeminejad; Andrew Cowell; Mans Hulden
Creating lexical resources for polysynthetic languages–-the case of Arapaho

W17-0103 [bib]: Nick Thieberger; Conal Tuohy
From Small to Big Data: paper manuscripts to RDF triples of Australian Indigenous Vocabularies

W17-0104 [bib]: Emmanuel Ngué Um
Issues in digital text representation, on-line dissemination, sharing and re-use for African minority languages

W17-0105 [bib]: Gary Holton; Kavon Hooshiar; Nick Thieberger
Developing collection management tools to create more robust and reliable linguistic data

W17-0106 [bib]: Gina-Anne Levow; Emily M. Bender; Patrick Littell; Kristen Howell; Shobhana Chelliah; Joshua Crowgey; Dan Garrette; Jeff Good; Sharon Hargus; David Inman; Michael Maxwell; Michael Tjalve; Fei Xia
STREAMLInED Challenges: Aligning Research Interests with Shared Tasks

W17-0107 [bib]: Lucy Bell; Lawrence Bell
Work With What You've Got

W17-0108 [bib]: Antti Arppe; Marie-Odile Junker; Delasie Torkornoo
Converting a comprehensive lexical database into a computational model: The case of East Cree verb inflection

W17-0109 [bib]: Ciprian Gerstenberger; Niko Partanen; Michael Rießler
Instant annotations in ELAN corpora of spoken and written Komi, an endangered language of the Barents Sea region

W17-0110 [bib]: Kristen Howell; Emily M. Bender; Michel Lockwood; Fei Xia; Olga Zamaraeva
Inferring Case Systems from IGT: Enriching the Enrichment

W17-0111 [bib]: Jordan Kodner; Spencer Kaplan; Hongzhi Xu; Mitchell P. Marcus; Charles Yang
Case Studies in the Automatic Characterization of Grammars from Small Wordlists

W17-0112 [bib]: Michael Maxwell; Aric Bills
Endangered Data for Endangered Languages: Digitizing Print dictionaries

W17-0113 [bib]: David Meyer
A computationally-assisted procedure for discovering poetic organization within oral tradition

W17-0114 [bib]: Jeffrey Micher
Improving Coverage of an Inuktitut Morphological Analyzer Using a Segmental Recurrent Neural Network

W17-0115 [bib]: Amanda Miller; Micha Elsner
Click reduction in fluent speech: a semi-automated analysis of Mangetti Dune !Xung

W17-0116 [bib]: C. Anton Rytting; Julie Yelle
DECCA Repurposed: Detecting transcription inconsistencies without an orthographic standard

W17-0117 [bib]: Moira Saltzman
Jejueo talking dictionary: A collaborative online database for language revitalization

W17-0118 [bib]: Olga Zamaraeva; František Kratochvíl; Emily M. Bender; Fei Xia; Kristen Howell
Computational Support for Finding Word Classes: A Case Study of Abui

W17-0119 [bib]: Patrick Littell; Aidan Pine; Henry Davis
Waldayu and Waldayu Mobile: Modern digital dictionary interfaces for endangered languages

W17-0120 [bib]: Alexa N. Little
Connecting Documentation and Revitalization: A New Approach to Language Apps

W17-0121 [bib]: Mat Bettinson; Steven Bird
Developing a Suite of Mobile Applications for Collaborative Language Documentation

W17-0122 [bib]: Timothy Kempton
Cross-language forced alignment to assist community-based linguistics for low resource languages

W17-0123 [bib]: Antonios Anastasopoulos; David Chiang
A case study on using speech-to-translation alignments for language documentation

Proceedings of the 21st Nordic Conference on Computational Linguistics

W17-0201 [bib]: Erik Velldal; Lilja Øvrelid; Petter Hohle
Joint UD Parsing of Norwegian Bokmål and Nynorsk

W17-0202 [bib]: Prasanth Kolachina; Martin Riedl; Chris Biemann
Replacing OOV Words For Dependency Parsing With Distributional Semantics

W17-0203 [bib]: Ali Basirat; Joakim Nivre
Real-valued Syntactic Word Vectors (RSV) for Greedy Neural Dependency Parsing

W17-0204 [bib]: Kimmo Kettunen; Laura Löfberg
Tagging Named Entities in 19th Century and Modern Finnish Newspaper Material with a Finnish Semantic Tagger

W17-0205 [bib]: Marie Dubremetz; Joakim Nivre
Machine Learning for Rhetorical Figure Detection: More Chiasmus with Less Annotation

W17-0206 [bib]: Alexander Wallin; Pierre Nugues
Coreference Resolution for Swedish and German using Distant Supervision

W17-0207 [bib]: Kimmo Koskenniemi
Aligning phonemes using finte-state methods

W17-0208 [bib]: Katri Leino; Mikko Kurimo
Acoustic Model Compression with MAP adaptation

W17-0209 [bib]: Senka Drobac; Pekka Kauppinen; Krister Lindén
OCR and post-correction of historical Finnish texts

W17-0210 [bib]: Asbjørn Steinskog; Jonas Therkelsen; Björn Gambäck
Twitter Topic Modeling by Tweet Aggregation

W17-0211 [bib]: Anton Södergren; Pierre Nugues
A Multilingual Entity Linker Using PageRank and Semantic Graphs

W17-0212 [bib]: Avo Muromägi; Kairit Sirts; Sven Laur
Linear Ensembles of Word Embedding Models

W17-0213 [bib]: Flavio Massimiliano Cecchini; Chris Biemann; Martin Riedl
Using Pseudowords for Algorithm Comparison: An Evaluation Framework for Graph-based Word Sense Induction

W17-0214 [bib]: Tommi Pirinen; Francis M. Tyers; Trond Trosterud; Ryan Johnson; Kevin Unhammer; Tiina Puolakainen
North-Sámi to Finnish rule-based machine translation system

W17-0215 [bib]: Lene Antonsen; Ciprian Gerstenberger; Maja Kappfjell; Sandra Nystø Ráhka; Marja-Liisa Olthuis; Trond Trosterud; Francis M. Tyers
Machine translation with North Saami as a pivot language

W17-0216 [bib]: Jesper Näsman; Beáta Megyesi; Anne Palmér
SWEGRAM ėxtendash A Web-Based Tool for Automatic Annotation and Analysis of Swedish Texts

W17-0217 [bib]: Petter Hohle; Lilja Øvrelid; Erik Velldal
Optimizing a PoS Tagset for Norwegian Dependency Parsing

W17-0218 [bib]: Veronika Laippala; Juhani Luotolahti; Aki-Juhani Kyröläinen; Tapio Salakoski; Filip Ginter
Creating register sub-corpora for the Finnish Internet Parsebank

W17-0219 [bib]: Simon Dobnik; Erik de Graaf
KILLE: a Framework for Situated Agents for Learning Language Through Interaction

W17-0220 [bib]: Dimitrios Kokkinakis; Kristina Lundholm Fors; Eva Björkner; Arto Nordlund
Data Collection from Persons with Mild Forms of Cognitive Impairment and Healthy Controls - Infrastructure for Classification and Prediction of Dementia

W17-0221 [bib]: Tommi Jauhiainen; Krister Lindén; Heidi Jauhiainen
Evaluation of language identification methods using 285 languages

W17-0222 [bib]: Siim Orasmaa; Heiki-Jaan Kaalep
Can We Create a Tool for General Domain Event Analysis?

W17-0223 [bib]: Eckhard Bick
From Treebank to Propbank: A Semantic-Role and VerbNet Corpus for Danish

W17-0224 [bib]: Johannes Bjerva; Robert Östling
Cross-lingual Learning of Semantic Textual Similarity with Multilingual Word Representations

W17-0225 [bib]: Johannes Bjerva
Will my auxiliary tagging task help? Estimating Auxiliary Tasks Effectivity in Multi-Task Learning

W17-0226 [bib]: Carl Börstell; Robert Östling
Iconic Locations in Swedish Sign Language: Mapping Form to Meaning with Lexical Databases

W17-0227 [bib]: Marcus Klang; Pierre Nugues
Docforia: A Multilayer Document Model

W17-0228 [bib]: Viljami Venekoski; Jouko Vankka
Finnish resources for evaluating language model semantics

W17-0229 [bib]: Steinþór Steingrímsson; Jón Guðnason; Sigrún Helgadóttir; Eiríkur Rögnvaldsson
Málrómur: A Manually Verified Corpus of Recorded Icelandic Speech

W17-0230 [bib]: Sara Stymne
The Effect of Translationese on Tuning for Statistical Machine Translation

W17-0231 [bib]: Johannes Graën; Dominique Sandoz; Martin Volk
Multilingwis2 ėxtendash Explore Your Parallel Corpus

W17-0232 [bib]: Anders Nøklestad; Kristin Hagen; Janne Bondi Johannessen; Michał Kosek; Joel Priestley
A modernised version of the Glossa corpus search system

W17-0233 [bib]: Juhani Luotolahti; Jenna Kanerva; Filip Ginter
Dep_search: Efficient Search Tool for Large Dependency Parsebanks

W17-0234 [bib]: Jouna Pyysalo
Proto-Indo-European Lexicon: The Generative Etymological Dictionary of Indo-European Languages

W17-0235 [bib]: Roberts Rozis; Raivis Skadiņš
Tilde MODEL - Multilingual Open Data for EU Languages

W17-0236 [bib]: Adam Ek; Sofia Knuutinen
Mainstreaming August Strindberg with Text Normalization

W17-0237 [bib]: Murhaf Fares; Andrey Kutuzov; Stephan Oepen; Erik Velldal
Word vectors, reuse, and replicability: Towards a community repository of large-text resources

W17-0238 [bib]: Mika Koistinen; Kimmo Kettunen; Tuula Pääkkönen
Improving Optical Character Recognition of Finnish Historical Newspapers with a Combination of Fraktur & Antiqua Models and Image Preprocessing

W17-0239 [bib]: Pierre Lison; Andrey Kutuzov
Redefining Context Windows for Word Embedding Models: An Experimental Study

W17-0240 [bib]: Adam Persson
The Effect of Excluding Out of Domain Training Data from Supervised Named-Entity Recognition

W17-0241 [bib]: Andrew Salway; Paul Meurer; Knut Hofland; Øystein Reigem
Quote Extraction and Attribution from Norwegian Newspapers

W17-0242 [bib]: Heidi Sand; Erik Velldal; Lilja Øvrelid
Wordnet extension via word embeddings: Experiments on the Norwegian Wordnet

W17-0243 [bib]: Robert Östling; Carl Börstell; Moa Gärdenfors; Mats Wirén
Universal Dependencies for Swedish Sign Language

W17-0244 [bib]: Johan Falkenjack; Evelina Rennes; Daniel Fahlborg; Vida Johansson; Arne Jönsson
Services for text simplification and analysis

W17-0245 [bib]: Johannes Graën; Christof Bless
Exploring Properties of Intralingual and Interlingual Association Measures Visually

W17-0246 [bib]: Peter Juel Henrichsen
TALERUM - Learning Danish by Doing Danish

W17-0247 [bib]: Aarne Ranta; Prasanth Kolachina; Thomas Hallgren
Cross-Lingual Syntax: Relating Grammatical Framework with Universal Dependencies

W17-0248 [bib]: Victoria Rosén; Helge Dyvik; Paul Meurer; Koenraad De Smedt
Exploring Treebanks with INESS Search

W17-0249 [bib]: Aleksi Vesanto; Filip Ginter; Hannu Salmi; Asko Nivala; Tapio Salakoski
A System for Identifying and Exploring Text Repetition in Large Historical Document Corpora

Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017)

W17-0401 [bib]: Željko Agić
Cross-Lingual Parser Selection for Low-Resource Languages

W17-0402 [bib]: Lars Ahrenberg
Swedish Prepositions are not Pure Function Words

W17-0403 [bib]: Gosse Bouma; Gertjan Van Noord
Increasing Return on Annotation Investment: The Automatic Construction of a Universal Dependency Treebank for Dutch

W17-0404 [bib]: Çağrı Çöltekin; Ben Campbell; Erhard Hinrichs; Heike Telljohann
Converting the TüBa-D/Z Treebank of German to Universal Dependencies

W17-0405 [bib]: Peter Dirix; Liesbeth Augustinus; Daniel van Niekerk; Frank Van Eynde
Universal Dependencies for Afrikaans

W17-0406 [bib]: Kira Droganova; Daniel Zeman
Elliptic Constructions: Spotting Patterns in UD Treebanks

W17-0407 [bib]: Felix Hennig; Arne Köhn
Dependency Tree Transformation with Tree Transducers

W17-0408 [bib]: John Lee; Herman Leung; Keying Li
Towards Universal Dependencies for Learner Chinese

W17-0409 [bib]: Natalia Levshina
Does Syntactic Informativity Predict Word Length? A Cross-Linguistic Study Based on the Universal Dependencies Corpora

W17-0410 [bib]: Kadri Muischnek; Kaili Müürisep
Estonian Copular and Existential Constructions as an UD Annotation Problem

W17-0411 [bib]: Joakim Nivre; Chiao-Ting Fang
Universal Dependency Evaluation

W17-0412 [bib]: Martin Popel; Zdenĕk Žabokrtský; Martin Vojtek
Udapi: Universal API for Universal Dependencies

W17-0413 [bib]: Prokopis Prokopidis; Haris Papageorgiou
Universal Dependencies for Greek

W17-0414 [bib]: Aarne Ranta; Prasanth Kolachina
From Universal Dependencies to Abstract Syntax

W17-0415 [bib]: Natalie Schluter; Željko Agić
Empirically Sampling Universal Dependencies

W17-0416 [bib]: Sebastian Schuster; Matthew Lamm; Christopher D. Manning
Gapping Constructions in Universal Dependencies v2

W17-0417 [bib]: Hajime Senuma; Akiko Aizawa
Toward Universal Dependencies for Ainu

W17-0418 [bib]: Miikka Silfverberg; Mans Hulden
Automatic Morpheme Segmentation and Labeling in Universal Dependencies Resources

W17-0419 [bib]: Guillaume Wisniewski; Ophélie Lacroix
A Systematic Comparison of Syntactic Representations of Dependency Parsing

Proceedings of the Third Workshop on Computational Linguistics for Uralic Languages

W17-0601 [bib]: Jack Rueter; Mika Hämäläinen
Synchronized Mediawiki based analyzer dictionary development

W17-0602 [bib]: Jack Rueter
DEMO: Giellatekno Open-source click-in-text dictionaries for bringing closely related languages into contact.

W17-0603 [bib]: Eszter Simon; Nikolett Mus
Languages under the influence: Building a database of Uralic languages

W17-0604 [bib]: Ciprian Gerstenberger; Niko Partanen; Michael Rießler; Joshua Wilbur
Instant Annotations ėxtendash Applying NLP Methods to the Annotation of Spoken Language Documentation Corpora

W17-0605 [bib]: Guersande Chaminade; Thierry Poibeau
Preliminary Experiments concerning Verbal Predicative Structure Extraction from a Large Finnish Corpus

W17-0606 [bib]: Csilla Horváth; Norbert Szilágyi; Veronika Vincze; Àgoston Nagy
Language technology resources and tools for Mansi: an overview

W17-0607 [bib]: Francis M. Tyers; Mariya Sheyanova
Annotation schemes in North Sámi dependency parsing

W17-0608 [bib]: Sindre Reino Trosterud; Trond Trosterud; Anna-Kaisa Räisänen; Leena Niiranen; Mervi Haavisto; Kaisa Maliniemi
A morphological analyser for Kven

Proceedings of the 7th Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2017)

W17-0701 [bib]: Matthew Nelson; Stanislas Dehaene; Christophe Pallier; John Hale
Entropy Reduction correlates with temporal lobe activity

W17-0702 [bib]: Laurel Perkins; Naomi Feldman; Jeffrey Lidz
Learning an Input Filter for Argument Structure Acquisition

W17-0703 [bib]: Zachary Burchill; T. Florian Jaeger
Grounding sound change in ideal observer models of perception

W17-0704 [bib]: Rachael Tatman
“Oh, I've Heard That Before": Modelling Own-Dialect Bias After Perceptual Learning by Weighting Training Data

W17-0705 [bib]: Amanda Doucette
Inherent Biases of Recurrent Neural Networks for Phonological Assimilation and Dissimilation

W17-0706 [bib]: Naho Orita
Predicting Japanese scrambling in the wild

Proceedings of the 11th Linguistic Annotation Workshop

W17-0801 [bib]: Sven Buechel; Udo Hahn
Readers vs. Writers vs. Texts: Coping with Different Perspectives of Text Understanding in Emotion Annotation

W17-0802 [bib]: Courtney Napoles; Joel Tetreault; Aasish Pappu; Enrica Rosato; Brian Provenzale
Finding Good Conversations Online: The Yahoo News Annotated Comments Corpus

W17-0803 [bib]: Merel Scholman; Vera Demberg
Crowdsourcing discourse interpretations: On the influence of context and the reliability of a connective insertion task

W17-0804 [bib]: Özlem Çetinoğlu
A Code-Switching Corpus of Turkish-German Conversations

W17-0805 [bib]: Héctor Martínez Alonso; Amaury Delamaire; Benoît Sagot
Annotating omission in statement pairs

W17-0806 [bib]: Corien Bary; Leopold Hess; Kees Thijs; Peter Berck; Iris Hendrickx
Annotating Speech, Attitude and Perception Reports

W17-0807 [bib]: Atsushi Fujita; Kikuko Tanabe; Chiho Toyoshima; Mayuka Yamamoto; Kyo Kageura; Anthony Hartley
Consistent Classification of Translation Revisions: A Case Study of English-Japanese Student Translations

W17-0808 [bib]: Richard Eckart de Castilho; Nancy Ide; Emanuele Lapponi; Stephan Oepen; Keith Suderman; Erik Velldal; Marc Verhagen
Representation and Interchange of Linguistic Annotation. An In-Depth, Side-by-Side Comparison of Three Designs

W17-0809 [bib]: Deniz Zeyrek; Murathan Kurfalı
TDB 1.1: Extensions on Turkish Discourse Bank

W17-0810 [bib]: Maria Pia di Buono; Martin Tutek; Jan Šnajder; Goran Glavaš; Bojana Dalbelo Bašić; Natasa Milic-Frayling
Two Layers of Annotation for Representing Event Mentions in News Stories

W17-0811 [bib]: Syed Sarfaraz Akhtar; Arihant Gupta; Avijit Vajpayee; Arjit Srivastava; Manish Shrivastava
Word Similarity Datasets for Indian Languages: Annotation and Baseline Systems

W17-0812 [bib]: Jesse Dunietz; Lori Levin; Jaime Carbonell
The BECauSE Corpus 2.0: Annotating Causality and Overlapping Relations

W17-0813 [bib]: Ines Rehbein; Josef Ruppenhofer
Catching the Common Cause: Extraction and Annotation of Causal Relations and their Participants

W17-0814 [bib]: Silvana Hartmann; Éva Mújdricza-Maydt; Ilia Kuznetsov; Iryna Gurevych; Anette Frank
Assessing SRL Frameworks with Automatic Training Data Expansion

Proceedings of the 2nd Workshop on Linking Models of Lexical, Sentential and Discourse-level Semantics

W17-0901 [bib]: Lilian Wanzare; Alessandra Zarcone; Stefan Thater; Manfred Pinkal
Inducing Script Structure from Crowdsourced Event Descriptions via Semi-Supervised Clustering

W17-0902 [bib]: Rachel Wities; Vered Shwartz; Gabriel Stanovsky; Meni Adler; Ori Shapira; Shyam Upadhyay; Dan Roth; Eugenio Martínez-Cámara; Iryna Gurevych; Ido Dagan
A Consolidated Open Knowledge Representation for Multiple Texts

W17-0903 [bib]: Edoardo Maria Ponti; Anna Korhonen
Event-Related Features in Feedforward Neural Networks Contribute to Identifying Causal Relations in Discourse

W17-0904 [bib]: Manfred Klenner; Don Tuggener; Simon Clematide
Stance Detection in Facebook Posts of a German Right-wing Party

W17-0905 [bib]: Nathanael Chambers
Behind the Scenes of an Evolving Event Cloze Test

W17-0906 [bib]: Nasrin Mostafazadeh; Michael Roth; Annie Louis; Nathanael Chambers; James Allen
LSDSem 2017 Shared Task: The Story Cloze Test

W17-0907 [bib]: Roy Schwartz; Maarten Sap; Ioannis Konstas; Leila Zilles; Yejin Choi; Noah A. Smith
Story Cloze Task: UW NLP System

W17-0908 [bib]: Michael Bugert; Yevgeniy Puzikov; Andreas Rücklé; Judith Eckle-Kohler; Teresa Martin; Eugenio Martínez-Cámara; Daniil Sorokin; Maxime Peyrard; Iryna Gurevych
LSDSem 2017: Exploring Data Generation Methods for the Story Cloze Test

W17-0909 [bib]: Michael Flor; Swapna Somasundaran
Sentiment Analysis and Lexical Cohesion for the Story Cloze Task

W17-0910 [bib]: Niko Schenk; Christian Chiarcos
Resource-Lean Modeling of Coherence in Commonsense Stories

W17-0911 [bib]: Melissa Roemmele; Sosuke Kobayashi; Naoya Inoue; Andrew Gordon
An RNN-based Binary Classifier for the Story Cloze Test

W17-0912 [bib]: Pranav Goel; Anil Kumar Singh
IIT (BHU): System Description for LSDSem'17 Shared Task

W17-0913 [bib]: Todor Mihaylov; Anette Frank
Story Cloze Ending Selection Baselines and Data Examination

Proceedings of the MultiLing 2017 Workshop on Summarization and Summary Evaluation Across Source Types and Genres

W17-1001 [bib]: George Giannakopoulos; John Conroy; Jeff Kubina; Peter A. Rankel; Elena Lloret; Josef Steinberger; Marina Litvak; Benoit Favre
MultiLing 2017 Overview

W17-1002 [bib]: Ying Xu; Jey Han Lau; Timothy Baldwin; Trevor Cohn
Decoupling Encoder and Decoder Networks for Abstractive Document Summarization

W17-1003 [bib]: Gaetano Rossiello; Pierpaolo Basile; Giovanni Semeraro
Centroid-based Text Summarization through Compositionality of Word Embeddings

W17-1004 [bib]: Marina Litvak; Natalia Vanetik
Query-based summarization using MDL principle

W17-1005 [bib]: Lei Li; Liyuan Mao; Moye Chen
Word Embedding and Topic Modeling Enhanced Multiple Features for Content Linking and Argument / Sentiment Labeling in Online Forums

W17-1006 [bib]: Elena Lloret; Ester Boldrini; Patricio Martinez-Barco; Manuel Palomar
Ultra-Concise Multi-genre Summarisation of Web2.0: towards Intelligent Content Generation

W17-1007 [bib]: Samira Ellouze; Maher Jaoua; Lamia Hadrich Belguith
Machine Learning Approach to Evaluate MultiLingual Summaries

Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media

W17-1101 [bib]: Anna Schmidt; Michael Wiegand
A Survey on Hate Speech Detection using Natural Language Processing

W17-1102 [bib]: Ye Tian; Thiago Galery; Giulio Dulcinati; Emilia Molimpakis; Chao Sun
Facebook sentiment: Reactions and Emojis

W17-1103 [bib]: Jan Milan Deriu; Martin Weilenmann; Dirk Von Gruenigen; Mark Cieliebak
Potential and Limitations of Cross-Domain Sentiment Classification

W17-1104 [bib]: Kevin McKelvey; Peter Goutzounis; Stephen da Cruz; Nathanael Chambers
Aligning Entity Names with Online Aliases on Twitter

W17-1105 [bib]: Svitlana Vakulenko; Lyndon Nixon; Mihai Lupu
Character-based Neural Embeddings for Tweet Clustering

W17-1106 [bib]: Mark Cieliebak; Jan Milan Deriu; Dominic Egger; Fatih Uzdilli
A Twitter Corpus and Benchmark Resources for German Sentiment Analysis

Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)

W17-1201 [bib]: Marcos Zampieri; Shervin Malmasi; Nikola Ljubešić; Preslav Nakov; Ahmed Ali; Jörg Tiedemann; Yves Scherrer; Noëmi Aepli
Findings of the VarDial Evaluation Campaign 2017

W17-1202 [bib]: Gonzalo Donoso; David Sanchez
Dialectometric analysis of language variation in Twitter

W17-1203 [bib]: Taraka Rama; Çağrı Çöltekin; Pavel Sofroniev
Computational analysis of Gondi dialects

W17-1204 [bib]: Stefanie Dipper; Sandra Waldenberger
Investigating Diatopic Variation in a Historical Corpus

W17-1205 [bib]: Paolo Rosso
Author Profiling at PAN: from Age and Gender Identification to Language Variety Identification (invited talk)

W17-1206 [bib]: Tekabe Legesse Feleke
The similarity and Mutual Intelligibility between Amharic and Tigrigna Varieties

W17-1207 [bib]: Marta R. Costa-jussà
Why Catalan-Spanish Neural Machine Translation? Analysis, comparison and combination with standard Rule and Phrase-based technologies

W17-1208 [bib]: Hossein Hassani
Kurdish Interdialect Machine Translation

W17-1209 [bib]: Jennifer Williams; Charlie Dagli
Twitter Language Identification Of Similar Languages And Dialects Without Ground Truth

W17-1210 [bib]: Yves Scherrer; Achim Rabus
Multi-source morphosyntactic tagging for spoken Rusyn

W17-1211 [bib]: Abualsoud Hanani; Aziz Qaroush; Stephen Taylor
Identifying dialects with textual and acoustic cues

W17-1212 [bib]: Tommi Jauhiainen; Krister Lindén; Heidi Jauhiainen
Evaluating HeLI with Non-Linear Mappings

W17-1213 [bib]: Pablo Gamallo; Jose Ramom Pichel; Iñaki Alegria
A Perplexity-Based Method for Similar Languages Discrimination

W17-1214 [bib]: Yves Bestgen
Improving the Character Ngram Model for the DSL Task with BM25 Weighting and Less Frequently Used Feature Sets

W17-1215 [bib]: Marcelo Criscuolo; Sandra Maria Aluisio
Discriminating between Similar Languages with Word-level Convolutional Neural Networks

W17-1216 [bib]: Jörg Tiedemann
Cross-lingual dependency parsing for closely related languages - Helsinki's submission to VarDial 2017

W17-1217 [bib]: Helena Gomez; Ilia Markov; Jorge Baptista; Grigori Sidorov; David Pinto
Discriminating between Similar Languages Using a Combination of Typed and Untyped Character N-grams and Words

W17-1218 [bib]: Çağrı Çöltekin; Taraka Rama
Tübingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing

W17-1219 [bib]: Maria Medvedeva; Martin Kroon; Barbara Plank
When Sparse Traditional Models Outperform Dense Neural Networks: the Curious Case of Discriminating between Similar Languages

W17-1220 [bib]: Shervin Malmasi; Marcos Zampieri
German Dialect Identification in Interview Transcriptions

W17-1221 [bib]: Simon Clematide; Peter Makarov
CLUZH at VarDial GDI 2017: Testing a Variety of Machine Learning Tools for the Classification of Swiss German Dialects

W17-1222 [bib]: Shervin Malmasi; Marcos Zampieri
Arabic Dialect Identification Using iVectors and ASR Transcripts

W17-1223 [bib]: Adrien Barbaresi
Discriminating between Similar Languages using Weighted Subword Features

W17-1224 [bib]: Chris van der Lee; Antal van den Bosch
Exploring Lexical and Syntactic Features for Language Variety Identification

W17-1225 [bib]: Radu Tudor Ionescu; Andrei Butnaru
Learning to Identify Arabic and German Dialects using Multiple Kernels

W17-1226 [bib]: Rudolf Rosa; Daniel Zeman; David Mareček; Zdeněk Žabokrtský
Slavic Forest, Norwegian Wood

Proceedings of the Third Arabic Natural Language Processing Workshop

W17-1301 [bib]: Wafia Adouane; Simon Dobnik
Identification of Languages in Algerian Arabic Multilingual Documents

W17-1302 [bib]: Kareem Darwish; Hamdy Mubarak; Ahmed Abdelali
Arabic Diacritization: Stats, Rules, and Hacks

W17-1303 [bib]: El Moatez Billah Nagoudi; Didier Schwab
Semantic Similarity of Arabic Sentences with Word Embeddings

W17-1304 [bib]: Claudia Borg; Albert Gatt
Morphological Analysis for the Maltese Language: The challenges of a hybrid system

W17-1305 [bib]: Salam Khalifa; Sara Hassan; Nizar Habash
A Morphological Analyzer for Gulf Arabic Verbs

W17-1306 [bib]: Younes Samih; Mohammed Attia; Mohamed Eldesouki; Ahmed Abdelali; Hamdy Mubarak; Laura Kallmeyer; Kareem Darwish
A Neural Architecture for Dialectal Arabic Segmentation

W17-1307 [bib]: Salima Medhaffar; Fethi Bougares; Yannick Estève; Lamia Hadrich-Belguith
Sentiment Analysis of Tunisian Dialects: Linguistic Ressources and Experiments

W17-1308 [bib]: Rim El Ballouli; Wassim El-Hajj; Ahmad Ghandour; Shady Elbassuoni; Hazem Hajj; Khaled Shaban
CAT: Credibility Analysis of Arabic Content on Twitter

W17-1309 [bib]: Maha Alamri; William J. Teahan
A New Error Annotation for Dyslexic texts in Arabic

W17-1310 [bib]: Hany Ahmed; Mohamed Elaraby; Abdullah M. Mousa; Mostafa Elhosiny; Sherif Abdou; Mohsen Rashwan
An Unsupervised Speaker Clustering Technique based on SOM and I-vectors for Speech Recognition Systems

W17-1311 [bib]: Amany Fashwan; Sameh Alansary
SHAKKIL: An Automatic Diacritization System for Modern Standard Arabic Texts

W17-1312 [bib]: Fahad Albogamy; Allan Ramsay; Hanady Ahmed
Arabic Tweets Treebanking and Parsing: A Bootstrapping Approach

W17-1313 [bib]: Ahmad Khwileh; Haithem Afli; Gareth Jones; Andy Way
Identifying Effective Translations for Cross-lingual Arabic-to-English User-generated Speech Search

W17-1314 [bib]: Ramy Baly; Gilbert Badaro; Georges El-Khoury; Rawan Moukalled; Rita Aoun; Hazem Hajj; Wassim El-Hajj; Nizar Habash; Khaled Shaban
A Characterization Study of Arabic Twitter Data with a Benchmarking for State-of-the-Art Opinion Mining Models

W17-1315 [bib]: Lingliang Zhang; Nizar Habash; Godfried Toussaint
Robust Dictionary Lookup in Multiple Noisy Orthographies

W17-1316 [bib]: Kareem Darwish; Hamdy Mubarak; Ahmed Abdelali; Mohamed Eldesouki
Arabic POS Tagging: Don't Abandon Feature Engineering Just Yet

W17-1317 [bib]: Soumia Bougrine; Aicha Chorana; Abdallah Lakhdari; Hadda Cherroun
Toward a Web-based Speech Corpus for Algerian Dialectal Arabic Varieties

W17-1318 [bib]: Muhammad Abdul-Mageed
Not All Segments are Created Equal: Syntactically Motivated Sentiment Analysis in Lexical Space

W17-1319 [bib]: Mohamed Amine Menacer; Odile Mella; Dominique Fohr; Denis Jouvet; David Langlois; Kamel Smaili
An enhanced automatic speech recognition system for Arabic

W17-1320 [bib]: Dima Taji; Nizar Habash; Daniel Zeman
Universal Dependencies for Arabic

W17-1321 [bib]: Mohamed Al-Badrashiny; Abdelati Hawwari; Mona Diab
A Layered Language Model based Hybrid Approach to Automatic Full Diacritization of Arabic

W17-1322 [bib]: Nada Almarwani; Mona Diab
Arabic Textual Entailment with Word Embeddings

Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing

W17-1401 [bib]: Serge Sharoff
Toward Pan-Slavic NLP: Some Experiments with Language Adaptation

W17-1402 [bib]: Andrey Kutuzov; Elizaveta Kuzmenko; Lidia Pivovarova
Clustering of Russian Adjective-Noun Constructions using Word Embeddings

W17-1403 [bib]: Domagoj Alagić; Jan Šnajder
A Preliminary Study of Croatian Lexical Substitution

W17-1404 [bib]: Agata Savary; Jakub Waszczuk
Projecting Multiword Expression Resources on a Polish Treebank

W17-1405 [bib]: Achim Rabus; Yves Scherrer
Lexicon Induction for Spoken Rusyn – Challenges and Results

W17-1406 [bib]: Kaja Dobrovoljc; Tomaž Erjavec; Simon Krek
The Universal Dependencies Treebank for Slovenian

W17-1407 [bib]: Tanja Samardžić; Mirjana Starović; Željko Agić; Nikola Ljubešić
Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages

W17-1408 [bib]: Alexey Sorokin
Spelling Correction for Morphologically Rich Language: a Case Study of Russian

W17-1409 [bib]: Paula Gombar; Zoran Medić; Domagoj Alagić; Jan Šnajder
Debunking Sentiment Lexicons: A Case of Domain-Specific Sentiment Classification for Croatian

W17-1410 [bib]: Nikola Ljubešić; Tomaž Erjavec; Darja Fišer
Adapting a State-of-the-Art Tagger for South Slavic Languages to Non-Standard Text

W17-1411 [bib]: Leon Rotim; Jan Šnajder
Comparison of Short-Text Sentiment Analysis Methods for Croatian

W17-1412 [bib]: Jakub Piskorski; Lidia Pivovarova; Jan Šnajder; Josef Steinberger; Roman Yangarber
The First Cross-Lingual Challenge on Recognition, Normalization, and Matching of Named Entities in Slavic Languages

W17-1413 [bib]: Michał Marcińczuk; Jan Kocoń; Marcin Oleksy
Liner2 — a Generic Framework for Named Entity Recognition

W17-1414 [bib]: James Mayfield; Paul McNamee; Cash Costello
Language-Independent Named Entity Analysis Using Parallel Projection and Rule-Based Disambiguation

W17-1415 [bib]: Ekaterina Chernyak
Comparison of String Similarity Measures for Obscenity Filtering

W17-1416 [bib]: Justina Mandravickaite; Tomas Krilavičius
Stylometric Analysis of Parliamentary Speeches: Gender Dimension

W17-1417 [bib]: Kseniya Buraya; Lidia Pivovarova; Sergey Budkov; Andrey Filchenkov
Towards Never Ending Language Learning for Morphologically Rich Languages

W17-1418 [bib]: Ben Verhoeven; Iza Škrjanec; Senja Pollak
Gender Profiling for Slovene Twitter communication: the Influence of Gender Marking, Content and Style

Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017)

W17-1501 [bib]: Nafise Sadat Moosavi; Michael Strube
Use Generalized Representations, But Do Not Forget Surface Features

W17-1502 [bib]: Ander Soraluze; Olatz Arregi; Xabier Arregi; Arantza Díaz de Ilarraza
Enriching Basque Coreference Resolution System using Semantic Knowledge sources

W17-1503 [bib]: Maciej Ogrodniczuk; Bartłomiej Nitoń
Improving Polish Mention Detection with Valency Dictionary

W17-1504 [bib]: Pascal Amsili; Olga Seminck
A Google-Proof Collection of French Winograd Schemas

W17-1505 [bib]: Lesly Miculicich Werlen; Andrei Popescu-Belis
Using Coreference Links to Improve Spanish-to-English Machine Translation

W17-1506 [bib]: Yulia Grishina; Manfred Stede
Multi-source annotation projection of coreference chains: assessing strategies and testing opportunities

W17-1507 [bib]: Yulia Grishina
CORBON 2017 Shared Task: Projection-Based Coreference Resolution

W17-1508 [bib]: Michal Novák; Anna Nedoluzhko; Zdeněk Žabokrtský
Projection-based Coreference Resolution Using Deep Syntax

Proceedings of the First ACL Workshop on Ethics in Natural Language Processing

W17-1601 [bib]: Brian Larson
Gender as a Variable in Natural-Language Processing: Ethical Considerations

W17-1602 [bib]: Corina Koolen; Andreas van Cranenburgh
These are not the Stereotypes You are Looking For: Bias and Fairness in Authorial Gender Attribution

W17-1603 [bib]: Margot Mieskes
A Quantitative Study of Data in the NLP community

W17-1604 [bib]: Jochen L. Leidner; Vassilis Plachouras
Ethical by Design: Ethics Best Practices for Natural Language Processing

W17-1605 [bib]: Nitin Madnani; Anastassia Loukina; Alina von Davier; Jill Burstein; Aoife Cahill
Building Better Open-Source Tools to Support Fairness in Automated Scoring

W17-1606 [bib]: Rachael Tatman
Gender and Dialect Bias in YouTube's Automatic Captions

W17-1607 [bib]: Dave Lewis; Joss Moorkens; Kaniz Fatema
Integrating the Management of Personal Data Protection and Open Science with Research Ethics

W17-1608 [bib]: Carla Parra Escartín; Wessel Reijers; Teresa Lynn; Joss Moorkens; Andy Way; Chao-Hong Liu
Ethical Considerations in NLP Shared Tasks

W17-1609 [bib]: Rachel Rudinger; Chandler May; Benjamin Van Durme
Social Bias in Elicited Natural Language Inferences

W17-1610 [bib]: Simon Suster; Stephan Tulkens; Walter Daelemans
A Short Review of Ethical Challenges in Clinical Natural Language Processing

W17-1611 [bib]: Tyler Schnoebelen
Goal-Oriented Design for Ethical Machine Learning and NLP

W17-1612 [bib]: Adrian Benton; Glen Coppersmith; Mark Dredze
Ethical Research Protocols for Social Media Health Research

W17-1613 [bib]: Charese Smiley; Frank Schilder; Vassilis Plachouras; Jochen L. Leidner
Say the Right Thing Right: Ethics Issues in Natural Language Generation Systems

Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017)

W17-1701 [bib]: Petra Barancikova; Václava Kettnerová
ParaDi: Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs

W17-1702 [bib]: Sophie Chesney; Guillaume Jacquet; Ralf Steinberger; Jakub Piskorski
Multi-word Entity Classification in a Highly Multilingual Environment

W17-1703 [bib]: Marcos Garcia; Marcos García-Salido; Margarita Alonso-Ramos
Using bilingual word-embeddings for multilingual collocation extraction

W17-1704 [bib]: Agata Savary; Carlos Ramisch; Silvio Cordeiro; Federico Sangati; Veronika Vincze; Behrang QasemiZadeh; Marie Candito; Fabienne Cap; Voula Giouli; Ivelina Stoyanova; Antoine Doucet
The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions

W17-1705 [bib]: Katalin Ilona Simkó; Viktória Kovács; Veronika Vincze
USzeged: Identifying Verbal Multiword Expressions with POS Tagging and Parsing Techniques

W17-1706 [bib]: Luka Nerima; Vasiliki Foufi; Eric Wehrli
Parsing and MWE Detection: Fips at the PARSEME Shared Task

W17-1707 [bib]: Natalia Klyueva; Antoine Doucet; Milan Straka
Neural Networks for Multi-Word Expression Detection

W17-1708 [bib]: Stefan Bott; Sabine Schulte im Walde
Factoring Ambiguity out of the Prediction of Compositionality for German Multi-Word Expressions

W17-1709 [bib]: Jamie Y. Findlay
Multiword expressions and lexicalism: the view from LFG

W17-1710 [bib]: Kristina Geeraert; R. Harald Baayen; John Newman
Understanding Idiomatic Variation

W17-1711 [bib]: Natalie Vargas; Carlos Ramisch; Helena Caseli
Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment

W17-1712 [bib]: Justina Mandravickaite; Tomas Krilavičius
Identification of Multiword Expressions for Latvian and Lithuanian: Hybrid Approach

W17-1713 [bib]: Fabienne Cap
Show Me Your Variance and I Tell You Who You Are - Deriving Compound Compositionality from Word Alignments

W17-1714 [bib]: Melania Cabezas-García; Antonio San Martín
Semantic annotation to characterize contextual variation in terminological noun compounds: a pilot study

W17-1715 [bib]: Alfredo Maldonado; Lifeng Han; Erwan Moreau; Ashjan Alsulaimani; Koel Dutta Chowdhury; Carl Vogel; Qun Liu
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with Syntactic Dependency Features and Semantic Re-Ranking

W17-1716 [bib]: Tiberiu Boroş; Sonia Pipa; Verginica Barbu Mititelu; Dan Tufiş
A data-driven approach to verbal multiword expression detection. PARSEME Shared Task system description paper

W17-1717 [bib]: Hazem Al Saied; Matthieu Constant; Marie Candito
The ATILF-LLF System for Parseme Shared Task: a Transition-based Verbal Multiword Expression Tagger

W17-1718 [bib]: Shiva Taslimipoor; Omid Rohanian; Ruslan Mitkov; Afsaneh Fazly
Investigating the Opacity of Verb-Noun Multiword Expression Usages in Context

W17-1719 [bib]: Archna Bhatia; Choh Man Teng; James Allen
Compositionality in Verb-Particle Constructions

W17-1720 [bib]: Uxoa Iñurrieta; Itziar Aduriz; Arantza Diaz de Ilarraza; Gorka Labaka; Kepa Sarasola
Rule-Based Translation of Spanish Verb-Noun Combinations into Basque

W17-1721 [bib]: Veronika Vincze
Verb-Particle Constructions in Questions

W17-1722 [bib]: Marion Weller-Di Marco
Simple Compound Splitting for German

W17-1723 [bib]: Manon Scholivet; Carlos Ramisch
Identification of Ambiguous Multiword Expressions Using Sequence Models and Lexical Resources

W17-1724 [bib]: Agnès Tutin; Olivier Kraif
Comparing Recurring Lexico-Syntactic Trees (RLTs) and Ngram Techniques for Extended Phraseology Extraction

W17-1725 [bib]: Matthieu Constant; Héctor Martínez Alonso
Benchmarking Joint Lexical and Syntactic Analysis on Multiword-Rich Data

W17-1726 [bib]: King Chan; Julian Brooke; Timothy Baldwin
Semi-Automated Resolution of Inconsistency for a Harmonized Multiword Expression and Dependency Parse Annotation

W17-1727 [bib]: Maja Buljan; Jan Šnajder
Combining Linguistic Features for the Detection of Croatian Multiword Expressions

W17-1728 [bib]: Maximilian Köper; Sabine Schulte im Walde
Complex Verbs are Different: Exploring the Visual Modality in Multi-Modal Models to Predict Compositionality

Proceedings of the Workshop Computational Semantics Beyond Events and Roles

W17-1801 [bib]: Alexander Calderwood; Elizabeth A. Pruett; Raymond Ptucha; Christopher Homan; Cecilia Ovesdotter Alm
Understanding the Semantics of Narratives of Interpersonal Violence through Reader Annotations and Physiological Reactions

W17-1802 [bib]: Gene Kim; Lenhart Schubert
Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

W17-1803 [bib]: Ingrid Falk; Fabienne Martin
Towards a lexicon of event-selecting predicates for a French FactBank

W17-1804 [bib]: Federico Fancellu; Siva Reddy; Adam Lopez; Bonnie Webber
Universal Dependencies to Logical Form with Negation Scope

W17-1805 [bib]: Johan Bos
Meaning Banking beyond Events and Roles

W17-1806 [bib]: Begoña Altuna; Anne-Lyse Minard; Manuela Speranza
The Scope and Focus of Negation: A Complete Annotation Framework for Italian

W17-1807 [bib]: Montserrat Marimon; Jorge Vivaldi; Núria Bel
Annotation of negation in the IULA Spanish Clinical Record Corpus

W17-1808 [bib]: Noa Cruz; Roser Morante; Manuel J. Maña López; Jacinto Mata Vázquez; Carlos L. Parra Calderón
Annotating Negation in Spanish Clinical Texts

W17-1809 [bib]: Hangfeng He; Federico Fancellu; Bonnie Webber
Neural Networks for Negation Cue Detection in Chinese

W17-1810 [bib]: Martine Enger; Erik Velldal; Lilja Øvrelid
An open-source tool for negation detection: a maximum-margin approach

Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications

W17-1901 [bib]: Pablo Gamallo; Martín Pereira-Fariña
Compositional Semantics using Feature-Based Models from WordNet

W17-1902 [bib]: Mikhail Khodak; Andrej Risteski; Christiane Fellbaum; Sanjeev Arora
Automated WordNet Construction Using Word Embeddings

W17-1903 [bib]: Maximilian Köper; Sabine Schulte im Walde
Improving Verb Metaphor Detection by Propagating Abstractness to Words, Phrases and Individual Senses

W17-1904 [bib]: Yuan Ling; Yuan An; Sadid Hasan
Improving Clinical Diagnosis Inference through Integration of Structured and Unstructured Knowledge

W17-1905 [bib]: Kentaro Kanada; Tetsunori Kobayashi; Yoshihiko Hayashi
Classifying Lexical-semantic Relationships by Exploiting Sense/Concept Representations

W17-1906 [bib]: Milton King; Paul Cook
Supervised and unsupervised approaches to measuring usage similarity

W17-1907 [bib]: Ignatius Ezeani; Mark Hepple; Ikechukwu Onyenwe
Lexical Disambiguation of Igbo using Diacritic Restoration

W17-1908 [bib]: Mahmoud El-Haj; Paul Rayson; Scott Piao; Stephen Wattam
Creating and Validating Multilingual Semantic Representations for Six Languages: Expert versus Non-Expert Crowds

W17-1909 [bib]: Alexander Panchenko; Stefano Faralli; Simone Paolo Ponzetto; Chris Biemann
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation

W17-1910 [bib]: Thomas Kober; Julie Weeds; John Wilkie; Jeremy Reffin; David Weir
One Representation per Word - Does it make Sense for Composition?

W17-1911 [bib]: Kyoung-Rok Jang; Sung-Hyon Myaeng
Elucidating Conceptual Properties from Word Embeddings

W17-1912 [bib]: Enrico Mensa; Daniele P. Radicioni; Antonio Lieto
TTCSe: a Vectorial Resource for Computing Conceptual Similarity

W17-1913 [bib]: Lorenzo Gregori; Alessandro Panunzi
Measuring the Italian-English lexical gap for action verbs and its impact on translation

W17-1914 [bib]: Anne Cocos; Marianna Apidianaki; Chris Callison-Burch
Word Sense Filtering Improves Embedding-Based Lexical Substitution

W17-1915 [bib]: Aleksander Wawer; Agnieszka Mykowiecka
Supervised and Unsupervised Word Sense Disambiguation on Word Embedding Vectors of Unambigous Synonyms

Proceedings of the Sixth Workshop on Vision and Language

W17-2001 [bib]: Yanchao Yu; Arash Eshghi; Gregory Mills; Oliver Lemon
The BURCHAK corpus: a Challenge Data Set for Interactive Learning of Visually Grounded Word Meanings

W17-2002 [bib]: Brandon Birmingham; Adrian Muscat
The Use of Object Labels and Spatial Prepositions as Keywords in a Web-Retrieval-Based Image Caption Generation System

W17-2003 [bib]: Aparna Nurani Venkitasubramanian; Tinne Tuytelaars; Marie-Francine Moens
Learning to Recognize Animals by Watching Documentaries: Using Subtitles as Weak Supervision

W17-2004 [bib]: Iacer Calixto; Daniel Stein; Evgeny Matusov; Sheila Castilho; Andy Way
Human Evaluation of Multi-modal Neural Machine Translation: A Case-Study on E-Commerce Listing Titles

W17-2005 [bib]: Arnau Ramisa; Fei Yan; Francesc Moreno-Noguer; Krystian Mikolajczyk
The BreakingNews Dataset

W17-2006 [bib]: Patrizia Paggio; Costanza Navarretta; Bart Jongejan
Automatic identification of head movements in video-recorded conversations: can words help?

W17-2007 [bib]: Antonio Rubio Romano; LongLong Yu; Edgar Simo-Serra; Francesc Moreno-Noguer
Multi-Modal Fashion Product Retrieval