ACL Logo ACL Anthology
A Digital Archive of Research Papers in Computational Linguistics

Google search the Anthology

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC-2012)

L12-1001  [bib]: Kristiina Jokinen; Silvi Tenjes
Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data

L12-1002  [bib]: Felix Burkhardt
“You Seem Aggressive!” Monitoring Anger in a Practical Application

L12-1003  [bib]: Felix Burkhardt
Fast Labeling and Transcription with the Speechalyzer Toolkit

L12-1004  [bib]: Peter Spyns; Elisabeth D'Halleweyn
Smooth Sailing for STEVIN

L12-1005  [bib]: Daniel Stein; Bela Usabaev
Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel

L12-1006  [bib]: Xabier Saralegi; Iker Manterola; Iñaki San Vicente
Building a Basque-Chinese Dictionary by Using English as Pivot

L12-1007  [bib]: Yi-jie Tang; Hsin-Hsi Chen
Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition

L12-1008  [bib]: Johanka Spoustová; Miroslav Spousta
A High-Quality Web Corpus of Czech

L12-1009  [bib]: Marc Luder
German Verb Patterns and Their Implementation in an Electronic Dictionary

L12-1010  [bib]: Diana Maynard; Mark A. Greenwood
Large Scale Semantic Annotation, Indexing and Search at The National Archives

L12-1011  [bib]: Abdul-Baquee Sharaf; Eric Atwell
QurAna: Corpus of the Quran annotated with Pronominal Anaphora

L12-1012  [bib]: Christian Scheible; Hinrich Schütze
Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank

L12-1013  [bib]: Simon Clematide; Stefan Gindl; Manfred Klenner; Stefanos Petrakis; Robert Remus; Josef Ruppenhofer; Ulli Waltinger; Michael Wiegand
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis

L12-1014  [bib]: Iñaki Sainz; Daniel Erro; Eva Navas; Inma Hernáez; Jon Sánchez; Ibon Saratxaga; Igor Odriozola
Versatile Speech Databases for High Quality Synthesis for Basque

L12-1015  [bib]: Hector Llorens; Leon Derczynski; Robert Gaizauskas; Estela Saquete
TIMEN: An Open Temporal Expression Normalisation Resource

L12-1016  [bib]: Julian Brooke; Graeme Hirst
Measuring Interlanguage: Native Language Identification with L1-influence Metrics

L12-1017  [bib]: Patrick Saint-Dizier
DISLOG: A logic-based language for processing discourse structures

L12-1018  [bib]: Michael Wiegand; Benjamin Roth; Eva Lasarcyk; Stephanie Köser; Dietrich Klakow
A Gold Standard for Relation Extraction in the Food Domain

L12-1019  [bib]: Sarah Bourse; Patrick Saint-Dizier
A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures

L12-1020  [bib]: Flore Barcellini; Camille Albert; Corinne Grosse; Patrick Saint-Dizier
Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring

L12-1021  [bib]: Xavier Tannier
WebAnnotator, an Annotation Tool for Web Pages

L12-1022  [bib]: Maciej Ogrodniczuk; Piotr Pęzik; Adam Przepiórkowski
Towards a comprehensive open repository of Polish language resources

L12-1023  [bib]: Agnieszka Patejuk; Adam Przepiórkowski
Towards an LFG parser for Polish: An exercise in parasitic grammar development

L12-1024  [bib]: Kristiina Jokinen; Graham Wilcock
Constructive Interaction for Talking about Interesting Topics

L12-1025  [bib]: Hilder Pereira; Eder Novais; Andre Mariotti; Ivandre Paraboni
Corpus-based Referring Expressions Generation

L12-1026  [bib]: Eder Novais; Ivandre Paraboni; Douglas Silva
Portuguese Text Generation from Large Corpora

L12-1027  [bib]: Volha Petukhova; Rodrigo Agerri; Mark Fishel; Sergio Penkale; Arantza del Pozo; Mirjam Sepesy Maucec; Andy Way; Panayota Georgakopoulou; Martin Volk
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles

L12-1028  [bib]: Erik Cambria; Yunqing Xia; Amir Hussain
Affective Common Sense Knowledge Acquisition for Sentiment Analysis

L12-1029  [bib]: Panikos Heracleous; Carlos Ishi; Takahiro Miyashita; Norihiro Hagita
Body-conductive acoustic sensors in human-robot communication

L12-1030  [bib]: Lieve Macken; Veronique Hoste; Marielle Leijten; Luuk Van Waes
From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information

L12-1031  [bib]: Verena Henrich; Erhard Hinrichs
A Comparative Evaluation of Word Sense Disambiguation Algorithms for German

L12-1032  [bib]: Sebastian Varges; Heike Bieler; Manfred Stede; Lukas C. Faulstich; Kristin Irsig; Malik Atalla
SemScribe: Natural Language Generation for Medical Reports

L12-1033  [bib]: Erhard Hinrichs; Thomas Zastrow
Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC

L12-1034  [bib]: Alain Joubert; Mathieu Lafourcade
A new dynamic approach for lexical networks evaluation

L12-1035  [bib]: Hongsuck Seo; Kyusong Lee; Gary Geunbae Lee; Soo-Ok Kweon; Hae-Ri Kim
Grammatical Error Annotation for Korean Learners of Spoken English

L12-1036  [bib]: Tobias Heinroth; Maximilian Grotz; Florian Nothdurft; Wolfgang Minker
Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues

L12-1037  [bib]: Marco Passarotti; Francesco Mambrini
First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin

L12-1038  [bib]: Stefanie Dipper; Melanie Seiss; Heike Zinsmeister
The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English

L12-1039  [bib]: Marta Tatu; Dan Moldovan
A Tool for Extracting Conversational Implicatures

L12-1040  [bib]: Dan Moldovan; Eduardo Blanco
Polaris: Lymba's Semantic Parser

L12-1041  [bib]: Veronika Vincze
Light Verb Constructions in the SzegedParalellFX English--Hungarian Parallel Corpus

L12-1042  [bib]: Dominique Fohr; Odile Mella
CoALT: A Software for Comparing Automatic Labelling Tools

L12-1043  [bib]: Lucie Válková; Martina Waclawičová; Michal Křen
Balanced data repository of spontaneous spoken Czech

L12-1044  [bib]: Volha Petukhova; Harry Bunt
The coding and annotation of multimodal dialogue acts

L12-1045  [bib]: Antje Schlaf; Robert Remus
Learning Categories and their Instances by Contextual Features

L12-1046  [bib]: Mathias Bank; Robert Remus; Martin Schierle
Textual Characteristics for Language Engineering

L12-1047  [bib]: Mathias Bank; Martin Schierle
A Survey of Text Mining Architectures and the UIMA Standard

L12-1048  [bib]: Melanie Seiss
A Rule-based Morphological Analyzer for Murrinh-Patha

L12-1049  [bib]: Piek Vossen; Attila Görög; Rubén Izquierdo; Antal Van den Bosch
DutchSemCor: Targeting the ideal sense-tagged corpus

L12-1050  [bib]: Bruno Cartoni; Thomas Meyer
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies

L12-1051  [bib]: Abdul-Baquee Sharaf; Eric Atwell
QurSim: A corpus for evaluation of relatedness in short texts

L12-1052  [bib]: Sara Stymne; Henrik Danielsson; Sofia Bremin; Hongzhan Hu; Johanna Karlsson; Anna Prytz Lillkull; Martin Wester
Eye Tracking as a Tool for Machine Translation Error Analysis

L12-1053  [bib]: Bart Jongejan
Automatic annotation of head velocity and acceleration in Anvil

L12-1054  [bib]: Jyrki Niemi; Krister Lindén
Representing the Translation Relation in a Bilingual Wordnet

L12-1055  [bib]: Marianna J. Martindale
Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?

L12-1056  [bib]: Gülşen Eryiğit
The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish

L12-1057  [bib]: Zoya Gavrilov; Stan Sclaroff; Carol Neidle; Sven Dickinson
Detecting Reduplication in Videos of American Sign Language

L12-1058  [bib]: Alexandr Rosen; Martin Vavřín
Building a multilingual parallel corpus for human users

L12-1059  [bib]: Kirk Roberts; Michael A. Roach; Joseph Johnson; Josh Guthrie; Sanda M. Harabagiu
EmpaTweet: Annotating and Detecting Emotions on Twitter

L12-1060  [bib]: Christina Feilmayr; Birgit Pröll; Elisabeth Linsmayr
EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems

L12-1061  [bib]: Nedelina Ivanova; Olle Eriksen
BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users

L12-1062  [bib]: Anton Leuski; Carsten Eickhoff; James Ganis; Victor Lavrenko
The BladeMistress Corpus: From Talk to Action in Virtual Worlds

L12-1063  [bib]: Béatrice Arnulphy; Xavier Tannier; Anne Vilnat
Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French

L12-1064  [bib]: Geneviève Caelen-Haumont; Sethserey Sam
Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu

L12-1065  [bib]: Cristina Bosco; Manuela Sanguinetti; Leonardo Lesmo
The Parallel-TUT: a multilingual and multiformat treebank

L12-1066  [bib]: Arno Scharl; Marta Sabou; Stefan Gindl; Walter Rafelsberger; Albert Weichselbraun
Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources

L12-1067  [bib]: Florian Nothdurft; Wolfgang Minker
Using multimodal resources for explanation approaches in intelligent systems

L12-1068  [bib]: Markus Forsberg; Torbjörn Lager
Cloud Logic Programming for Integrating Language Technology Resources

L12-1069  [bib]: Fabio Tamburini; Matias Melandri
AnIta: a powerful morphological analyser for Italian

L12-1070  [bib]: Sylvia Springorum; Sabine Schulte im Walde; Antje Roßdeutscher
Automatic classification of German """"an"""" particle verbs

L12-1071  [bib]: Roberta Catizone; Louise Guthrie; Arthur Thomas; Yorick Wilks
LIE: Leadership, Influence and Expertise

L12-1072  [bib]: Valentina Bartalesi Lenzi; Giovanni Moretti; Rachele Sprugnoli
CAT: the CELCT Annotation Tool

L12-1073  [bib]: Yulan He; Hassan Saif; Zhongyu Wei; Kam-Fai Wong
Quantising Opinions for Political Tweets Analysis

L12-1074  [bib]: Radu Ion; Elena Irimia; Dan Ștefănescu; Dan Tufiș
ROMBAC: The Romanian Balanced Annotated Corpus

L12-1075  [bib]: Stasinos Konstantopoulos; Valia Kordoni; Nicola Cancedda; Vangelis Karkaletsis; Dietrich Klakow; Jean-Michel Renders
Task-Driven Linguistic Analysis based on an Underspecified Features Representation

L12-1076  [bib]: Ismaïl El Maarouf; Jeanne Villaneau
A French Fairy Tale Corpus syntactically and semantically annotated

L12-1077  [bib]: Roser Morante; Walter Daelemans
ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories

L12-1078  [bib]: Danuta Ploch; Leonhard Hennig; Angelina Duka; Ernesto William De Luca; Sahin Albayrak
GerNED: A German Corpus for Named Entity Disambiguation

L12-1079  [bib]: Silvia Vázquez; Núria Bel
A Classification of Adjectives for Polarity Lexicons Enhancement

L12-1080  [bib]: Héctor Martínez Alonso; Núria Bel; Bolette Sandford Pedersen
A voting scheme to detect semantic underspecification

L12-1081  [bib]: Aditi Sharma Grover; Annamart Nieman; Gerhard Van Huyssteen; Justus Roux
Aspects of a Legal Framework for Language Resource Management

L12-1082  [bib]: Lucie Poláková; Pavlína Jínová; Jiří Mírovský
Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component

L12-1083  [bib]: Heiki-Jaan Kaalep; Kadri Muischnek
Robust clause boundary identification for corpus annotation

L12-1084  [bib]: R.P. Clapham; L. van der Molen; R.J.JH. van Son; M. van den Brekel; F.J.M. Hilgers
NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy

L12-1085  [bib]: Iñaki San Vicente; Iker Manterola
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web

L12-1086  [bib]: Samuel Fernando; Mark Stevenson
Mapping WordNet synsets to Wikipedia articles

L12-1087  [bib]: Sanja Seljan; Marija Brkić; Tomislav Vičić
BLEU Evaluation of Machine-Translated English-Croatian Legislation

L12-1088  [bib]: Wolfgang Seeker; Jonas Kuhn
Making Ellipses Explicit in Dependency Conversion for a German Treebank

L12-1089  [bib]: Jorge Carrillo de Albornoz; Laura Plaza; Pablo Gervás
SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis

L12-1090  [bib]: John Noecker Jr; Michael Ryan
Distractorless Authorship Verification

L12-1091  [bib]: Majdi Sawalha; Claire Brierley; Eric Atwell
Predicting Phrase Breaks in Classical and Modern Standard Arabic Text

L12-1092  [bib]: Claire Brierley; Majdi Sawalha; Eric Atwell
Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing

L12-1093  [bib]: Hitokazu Matsushita; Deryle Lonsdale
Item Development and Scoring for Japanese Oral Proficiency Testing

L12-1094  [bib]: Ziqi Zhang; Philip Webster; Victoria Uren; Andrea Varga; Fabio Ciravegna
Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing

L12-1095  [bib]: Janine Pimentel
Identifying equivalents of specialized verbs in a bilingual comparable corpus of judgments: A frame-based methodology

L12-1096  [bib]: Francisco Costa; António Branco
TimeBankPT: A TimeML Annotated Corpus of Portuguese

L12-1097  [bib]: Sanni Nimb; Bolette Sandford Pedersen
Towards a richer wordnet representation of properties

L12-1098  [bib]: Lars Borin; Markus Forsberg; Johan Roxendal
Korp ― the corpus infrastructure of Spräkbanken

L12-1099  [bib]: Lars Borin; Markus Forsberg; Leif-Jöran Olsson; Jonatan Uppström
The open lexical infrastructure of Spräkbanken

L12-1100  [bib]: Seth Kulick; Ann Bies; Justin Mott
Further Developments in Treebank Error Detection Using Derivation Trees

L12-1101  [bib]: Chris Biemann
Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution

L12-1102  [bib]: Charlotte Alazard; Corine Astésano; Michel Billières
MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions.

L12-1103  [bib]: Andrei Popescu-Belis; Thomas Meyer; Jeevanthi Liyanapathirana; Bruno Cartoni; Sandrine Zufferey
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns

L12-1104  [bib]: Bonan Min; Ralph Grishman
Challenges in the Knowledge Base Population Slot Filling Task

L12-1105  [bib]: Alessandra Zarcone; Stefan Rued
Logical metonymies and qualia structures: an annotated database of logical metonymies for German

L12-1106  [bib]: Martina Katalin Szabó; Veronika Vincze; István Nagy T.
HunOr: A Hungarian―Russian Parallel Corpus

L12-1107  [bib]: Marcin Woliński; Marcin Miłkowski; Maciej Ogrodniczuk; Adam Przepiórkowski
PoliMorf: a (not so) new open morphological dictionary for Polish

L12-1108  [bib]: Elena Volodina; Sofie Johansson Kokkinakis
Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish

L12-1109  [bib]: Valentin I. Spitkovsky; Angel X. Chang
A Cross-Lingual Dictionary for English Wikipedia Concepts

L12-1110  [bib]: Martin Majliš; Zdeněk Žabokrtský
Language Richness of the Web

L12-1111  [bib]: Stefania Degaetano-Ortlieb; Ekaterina Lapshinova-Koltunski; Elke Teich
Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach

L12-1112  [bib]: Will Roberts; Valia Kordoni
Using Verb Subcategorization for Word Sense Disambiguation

L12-1113  [bib]: Sigrid Klerke; Anders Søgaard
DSim, a Danish Parallel Corpus for Text Simplification

L12-1114  [bib]: Magali Sanches Duran; Sandra Maria Aluísio
Propbank-Br: a Brazilian Treebank annotated with semantic role labels

L12-1115  [bib]: Slav Petrov; Dipanjan Das; Ryan McDonald
A Universal Part-of-Speech Tagset

L12-1116  [bib]: Aline Villavicencio; Beracah Yankama; Marco Idiart; Robert Berwick
A large scale annotated child language construction database

L12-1117  [bib]: Xuansong Li; Stephanie Strassel; Stephen Grimes; Safa Ismael; Mohamed Maamouri; Ann Bies; Nianwen Xue
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures

L12-1118  [bib]: Xuansong Li; Stephanie Strassel; Heng Ji; Kira Griffitt; Joe Ellis
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual

L12-1119  [bib]: Kugatsu Sadamitsu; Kuniko Saito; Kenji Imamura; Yoshihiro Matsuo
Constructing a Class-Based Lexical Dictionary using Interactive Topic Models

L12-1120  [bib]: Shota Yamasaki; Hirohisa Furukawa; Masafumi Nishida; Kristiina Jokinen; Seiichi Yamamoto
Multimodal Corpus of Multi-party Conversations in Second Language

L12-1121  [bib]: Satoshi Sato
Dictionary Look-up with Katakana Variant Recognition

L12-1122  [bib]: Angel X. Chang; Christopher Manning
SUTime: A library for recognizing and normalizing time expressions

L12-1123  [bib]: Dimitris Metaxas; Bo Liu; Fei Yang; Peng Yang; Nicholas Michael; Carol Neidle
Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking

L12-1124  [bib]: Eleanor Clark; Kenji Araki
Two Database Resources for Processing Social Media English Text

L12-1125  [bib]: Maristella Agosti; Birgit Alber; Giorgio Maria Di Nunzio; Marco Dussin; Stefan Rabanus; Alessandra Tomaselli
A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties

L12-1126  [bib]: Alexandros Papangelis; Vangelis Karkaletsis; Fillia Makedon
Evaluation of Online Dialogue Policy Learning Techniques

L12-1127  [bib]: Anoop Kunchukuttan; Shourya Roy; Pratik Patel; Kushal Ladha; Somya Gupta; Mitesh M. Khapra; Pushpak Bhattacharyya
Experiences in Resource Generation for Machine Translation through Crowdsourcing

L12-1128  [bib]: Aitor Gonzalez-Agirre; Egoitz Laparra; German Rigau
Multilingual Central Repository version 3.0

L12-1129  [bib]: Eleftherios Avramidis; Aljoscha Burchardt; Christian Federmann; Maja Popović; Cindy Tscherwinka; David Vilar
Involving Language Professionals in the Evaluation of Machine Translation

L12-1130  [bib]: Paola Velardi; Roberto Navigli; Stefano Faralli; Juana Maria Ruiz-Martinez
A New Method for Evaluating Automatically Learned Terminological Taxonomies

L12-1131  [bib]: Gloria Gagliardi; Edoardo Lombardi Vallauri; Fabio Tamburini
A topologic view of Topic and Focus marking in Italian

L12-1132  [bib]: Sucheta Ghosh; Richard Johansson; Giuseppe Riccardi; Sara Tonelli
Improving the Recall of a Discourse Parser by Constraint-based Postprocessing

L12-1133  [bib]: Verginica Barbu Mititelu
Adding Morpho-semantic Relations to the Romanian Wordnet

L12-1134  [bib]: Ioana Vasilescu; Martine Adda-Decker; Lori Lamel
Cross-lingual studies of ASR errors: paradigms for perceptual evaluations

L12-1135  [bib]: Karin Friberg Heppin; Maria Toporowska Gronostaj
The Rocky Road towards a Swedish FrameNet - Creating SweFN

L12-1136  [bib]: Dietmar Schabus; Michael Pucher; Gregor Hofer
Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis

L12-1137  [bib]: Przemyslaw Lenkiewicz; Binyam Gebrekidan Gebre; Oliver Schreer; Stefano Masneri; Daniel Schneider; Sebastian Tschöpel
AVATecH ― automated annotation through audio and video analysis

L12-1138  [bib]: Violeta Seretan
Acquisition of Syntactic Simplification Rules for French

L12-1139  [bib]: K Saravanan; Monojit Choudhury; Raghavendra Udupa; A Kumaran
An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora

L12-1140  [bib]: Chenhui Chu; Toshiaki Nakazawa; Sadao Kurohashi
Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese

L12-1141  [bib]: Carlos Morell; Jorge Vivaldi; Núria Bel
Iula2Standoff: a tool for creating standoff documents for the IULACT

L12-1142  [bib]: André Bittar; Caroline Hagège; Véronique Moriceau; Xavier Tannier; Charles Teissèdre
Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement

L12-1143  [bib]: Michel Généreux; Iris Hendrickx; Amália Mendes
Introducing the Reference Corpus of Contemporary Portuguese Online

L12-1144  [bib]: Patrick Ziering; Sina Zarrieß; Jonas Kuhn
A Corpus-based Study of the German Recipient Passive

L12-1145  [bib]: Tom De Smedt; Walter Daelemans
“Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.

L12-1146  [bib]: Kristiina Muhonen; Tanja Purtonen
Rule-Based Detection of Clausal Coordinate Ellipsis

L12-1147  [bib]: Xavier Tannier; Véronique Moriceau; Béatrice Arnulphy; Ruixin He
Evolution of Event Designation in Media: Preliminary Study

L12-1148  [bib]: Anselmo Peñas; Eduard Hovy; Pamela Forner; Álvaro Rodrigo; Richard Sutcliffe; Corina Forascu; Caroline Sporleder
Evaluating Machine Reading Systems through Comprehension Tests

L12-1149  [bib]: Xinkai Wang; Paul Thompson; Jun'ichi Tsujii; Sophia Ananiadou
Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries

L12-1150  [bib]: Aitor Gonzalez-Agirre; Mauro Castillo; German Rigau
A proposal for improving WordNet Domains

L12-1151  [bib]: Henk van den Heuvel; Eric Sanders; Robin Rutten; Stef Scagliola; Paula Witkamp
An Oral History Annotation Tool for INTER-VIEWs

L12-1152  [bib]: Matti Karppa; Tommi Jantunen; Ville Viitaniemi; Jorma Laaksonen; Birgitta Burger; Danny De Weerdt
Comparing computer vision analysis of signed language video with motion capture recordings

L12-1153  [bib]: Juan Pablo Martínez Cortés; Jim O'Regan; Francis Tyers
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese

L12-1154  [bib]: Dirk Goldhahn; Thomas Eckart; Uwe Quasthoff
Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages

L12-1155  [bib]: Thomas Ulrich Christiansen; Peter Juel Henrichsen
Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies

L12-1156  [bib]: Lluis-F. Hurtado; Fernando Garcia; Emilio Sanchis; Encarna Segarra
The acquisition and dialog act labeling of the EDECAN-SPORTS corpus

L12-1157  [bib]: Alexander Schmitt; Stefan Ultes; Wolfgang Minker
A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System

L12-1158  [bib]: Livio Robaldo; Jakub Szymanik
Pragmatic identification of the witness sets

L12-1159  [bib]: Christian Smith; Henrik Danielsson; Arne Jönsson
A good space: Lexical predictors in word space evaluation

L12-1160  [bib]: Jan Berka; Ondřej Bojar; Mark Fishel; Maja Popović; Daniel Zeman
Automatic MT Error Analysis: Hjerson Helping Addicter

L12-1161  [bib]: Daniele Pighin; Lluís Màrquez; Jonathan May
An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output

L12-1162  [bib]: Mojgan Seraji; Beáta Megyesi; Joakim Nivre
A Basic Language Resource Kit for Persian

L12-1163  [bib]: Amit Sangodkar; Om Damani
Re-ordering Source Sentences for SMT

L12-1164  [bib]: Alex Judea; Vivi Nastase; Michael Strube
Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles

L12-1165  [bib]: Behrang QasemiZadeh; Paul Buitelaar; Tianqi Chen; Georgeta Bordea
Semi-Supervised Technical Term Tagging With Minimal User Feedback

L12-1166  [bib]: Olivier Galibert; Sophie Rosset; Cyril Grouin; Pierre Zweigenbaum; Ludovic Quintard
Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign

L12-1167  [bib]: Yi Zhang; Rui Wang; Yu Chen
Joint Grammar and Treebank Development for Mandarin Chinese with HPSG

L12-1168  [bib]: Zygmunt Vetulani
Wordnet Based Lexicon Grammar for Polish

L12-1169  [bib]: Annette Rios; Anne Göhring
A tree is a Baum is an árbol is a sach'a: Creating a trilingual treebank

L12-1170  [bib]: Kseniya Zablotskaya; Fernando Fernández Martínez; Wolfgang Minker
Investigating Verbal Intelligence Using the TF-IDF Approach

L12-1171  [bib]: Kseniya Zablotskaya; Umair Rahim; Fernando Fernández Martínez; Wolfgang Minker
Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores

L12-1172  [bib]: Sanja Štajner; Ruslan Mitkov
Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach

L12-1173  [bib]: Ângela Costa; Tiago Luís; Joana Ribeiro; Ana Cristina Mendes; Luísa Coheur
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT

L12-1174  [bib]: Peter Juel Henrichsen; Marcus Uneson
SMALLWorlds -- Multilingual Content-Controlled Monologues

L12-1175  [bib]: Silvie Cinková; Martin Holub; Adam Rambousek; Lenka Smejkalová
A database of semantic clusters of verb usages

L12-1176  [bib]: Malin Ahlberg; Ramona Enache
Combining Language Resources Into A Grammar-Driven Swedish Parser

L12-1177  [bib]: Elizabeth Baran; Yaqin Yang; Nianwen Xue
Annotating dropped pronouns in Chinese newswire text

L12-1178  [bib]: Maria Aloni; Andreas van Cranenburgh; Raquel Fernandez; Marta Sznajder
Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions

L12-1179  [bib]: Kanika Gupta; Monojit Choudhury; Kalika Bali
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics

L12-1180  [bib]: Marie-Claude L'Homme; Janine Pimentel
Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology

L12-1181  [bib]: Daniele Pighin; Lluís Màrquez; Lluís Formiga
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output

L12-1182  [bib]: Steven Bethard; Oleksandr Kolomiyets; Marie-Francine Moens
Annotating Story Timelines as Temporal Dependency Structures

L12-1183  [bib]: Julien Seinturier; Elisabeth Murisasco; Emmanuel Bruno; Philippe Blache
An ontological approach to model and query multimodal concurrent linguistic annotations

L12-1184  [bib]: António Branco; Catarina Carvalheiro; Sílvia Pereira; Sara Silveira; João Silva; Sérgio Castro; João Graça
A PropBank for Portuguese: the CINTIL-PropBank

L12-1185  [bib]: Montse Cuadros; Lluís Padró; German Rigau
Highlighting relevant concepts from Topic Signatures

L12-1186  [bib]: Ranka Stanković; Cvetana Krstev; Ivan Obradović; Aleksandra Trtovac; Miloš Utvić
A tool for enhanced search of multilingual digital libraries of e-journals

L12-1187  [bib]: Pedro Fialho; Sérgio Curto; Ana Cristina Mendes; Luísa Coheur
Extending a wordnet framework for simplicity and scalability

L12-1188  [bib]: Tommaso Fornaciari; Massimo Poesio
DeCour: a corpus of DEceptive statements in Italian COURts

L12-1189  [bib]: Teresa Lynn; Ozlem Cetinoglu; Jennifer Foster; Elaine Uí Dhonnchadha; Mark Dras; Josef van Genabith
Irish Treebanking and Parsing: A Preliminary Evaluation

L12-1190  [bib]: Georgeta Bordea; Sabrina Kirrane; Paul Buitelaar; Bianca Pereira
Expertise Mining for Enterprise Content Management

L12-1191  [bib]: Mehmet Talha Çakmak; Süleyman Acar; Gülşen Eryiğit
Word Alignment for English-Turkish Language Pair

L12-1192  [bib]: Nelly Barbot; Olivier Boeffard; Arnaud Delhay
Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora

L12-1193  [bib]: Radu Ion
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora

L12-1194  [bib]: Mohammad Hoseyn Sheykholeslam; Behrouz Minaei-Bidgoli; Hossein Juzi
A Framework for Spelling Correction in Persian Language Using Noisy Channel Model

L12-1195  [bib]: Gilles Sérasset
Dbnary: Wiktionary as a LMF based Multilingual RDF network

L12-1196  [bib]: Dae-Lim Choi; Bong-Wan Kim; Yeon-Whoa Kim; Yong-Ju Lee; Yongnam Um; Minhwa Chung
Dysarthric Speech Database for Development of QoLT Software Technology

L12-1197  [bib]: Yunqing Xia; Guoyu Tang; Peng Jin; Xia Yang
CLTC: A Chinese-English Cross-lingual Topic Corpus

L12-1198  [bib]: Tommaso Caselli; Francesco Rubino; Francesca Frontini; Irene Russo; Valeria Quochi
Customizable SCF Acquisition in Italian

L12-1199  [bib]: Iris Merkus; Florian Schiel
Statistical Evaluation of Pronunciation Encoding

L12-1200  [bib]: Atro Voutilainen
Improving corpus annotation productivity: a method and experiment with interactive tagging

L12-1201  [bib]: Nava Maroto; Marie-Claude L'Homme; Amparo Alcina
Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation

L12-1202  [bib]: Riccardo Del Gratta; Francesca Frontini; Francesco Rubino; Irene Russo; Nicoletta Calzolari
The Language Library: supporting community effort for collective resource production

L12-1203  [bib]: Benoît Weber; Geneviève Caelen-Haumont; Binh Hai Pham; Do-Dat Tran
MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels

L12-1204  [bib]: Miriam Kaeshammer; Vera Demberg
German and English Treebanks and Lexica for Tree-Adjoining Grammars

L12-1205  [bib]: Helen Kaiyun Chen
Annotating a corpus of human interaction with prosodic profiles ― focusing on Mandarin repair/disfluency

L12-1206  [bib]: Steve Cassidy; Michael Haugh; Pam Peters; Mark Fallu
The Australian National Corpus: National Infrastructure for Language Resources

L12-1207  [bib]: Hongzhi Xu; Helen Kaiyun Chen; Chu-Ren Huang; Qin Lu; Dingxu Shi; Tin-Shing Chiu
A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies

L12-1208  [bib]: Han Sloetjes; Aarthy Somasundaram
ELAN development, keeping pace with communities' needs

L12-1209  [bib]: Ching-Sheng Lin; Zumrut Akcam; Samira Shaikh; Sharon Small; Ken Stahl; Tomek Strzalkowski; Nick Webb
Revealing Contentious Concepts Across Social Groups

L12-1210  [bib]: Fabrizio Borgia; Claudia S. Bianchini; Patrice Dalle; Maria De Marsico
Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)

L12-1211  [bib]: Balamuraliar; Aditya Joshi; Pushpak Bhattacharyya
Cost and Benefit of Using WordNet Senses for Sentiment Analysis

L12-1212  [bib]: Nuno Cardoso
Rembrandt - a named-entity recognition framework

L12-1213  [bib]: Bogdan Sacaleanu; Günter Neumann
An Adaptive Framework for Named Entity Combination

L12-1214  [bib]: Philippe Langlais; Patrick Drouin; Amélie Paulus; Eugénie Rompré Brodeur; Florent Cottin
Texto4Science: a Quebec French Database of Annotated Short Text Messages

L12-1215  [bib]: Eric Sanders
Collecting and Analysing Chats and Tweets in SoNaR

L12-1216  [bib]: Magdalena Rysova
Alternative Lexicalizations of Discourse Connectives in Czech

L12-1217  [bib]: Kikuo Maekawa
Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system

L12-1218  [bib]: Maria Teresa Pazienza; Armando Stellato; Andrea Turbati
PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases

L12-1219  [bib]: Jannik Strötgen; Michael Gertz
Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards

L12-1220  [bib]: Monica Lestari Paramita; Paul Clough; Ahmet Aker; Robert Gaizauskas
Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles

L12-1221  [bib]: Miriam Buendía-Castro; Beatriz Sánchez-Cárdenas
Linguistic knowledge for specialized text production

L12-1222  [bib]: Massimo Moneglia; Monica Monachini; Omar Calabrese; Alessandro Panunzi; Francesca Frontini; Gloria Gagliardi; Irene Russo
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation

L12-1223  [bib]: Daniel Zeman; David Mareček; Martin Popel; Loganathan Ramasamy; Jan Štěpánek; Zdeněk Žabokrtský; Jan Hajič
HamleDT: To Parse or Not to Parse?

L12-1224  [bib]: Lluís Padró; Evgeny Stanilovsky
FreeLing 3.0: Towards Wider Multilinguality

L12-1225  [bib]: Manny Rayner; Pierrette Bouillon; Johanna Gerlach
Evaluating Appropriateness Of System Responses In A Spoken CALL Game

L12-1226  [bib]: Matthew Fuchs; Nikos Tsourakis; Manny Rayner
A Scalable Architecture For Web Deployment of Spoken Dialogue Systems

L12-1227  [bib]: Dieter van Uytvanck; Herman Stehouwer; Lari Lampen
Semantic metadata mapping in practice: the Virtual Language Observatory

L12-1228  [bib]: Eiríkur Rögnvaldsson; Anton Karl Ingason; Einar Freyr Sigurðsson; Joel Wallenberg
The Icelandic Parsed Historical Corpus (IcePaHC)

L12-1229  [bib]: Ashwini Vaidya; Jinho D. Choi; Martha Palmer; Bhuvana Narasimhan
Empty Argument Insertion in the Hindi PropBank

L12-1230  [bib]: Patrick Paroubek; Xavier Tannier
A Rough Set Formalization of Quantitative Evaluation with Ambiguity

L12-1231  [bib]: Eleftherios Avramidis; Marta R. Costa-Jussà; Christian Federmann; Josef van Genabith; Maite Melero; Pavel Pecina
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation

L12-1232  [bib]: Tomaž Erjavec
The goo300k corpus of historical Slovene

L12-1233  [bib]: Michał Marcińczuk; Jan Kocoń; Bartosz Broda
Inforex -- a web-based tool for text corpus management and semantic annotation

L12-1234  [bib]: Myriam Rakho; Éric Laporte; Matthieu Constant
A new semantically annotated corpus with syntactic-semantic and cross-lingual senses

L12-1235  [bib]: Jan Odijk
Recent Developments in CLARIN-NL

L12-1236  [bib]: Lionel Nicolas; Jacques Farré; Cécile Darme
Unsupervised acquisition of concatenative morphology

L12-1237  [bib]: Leon Derczynski; Hector Llorens; Estela Saquete
Massively Increasing TIMEX3 Resources: A Transduction Approach

L12-1238  [bib]: Peter Exner; Pierre Nugues
Constructing Large Proposition Databases

L12-1239  [bib]: Rasmus Sundberg; Anders Eriksson; Johan Bini; Pierre Nugues
Visualizing Sentiment Analysis on a User Forum

L12-1240  [bib]: Binyam Gebrekidan Gebre; Peter Wittenburg; Przemyslaw Lenkiewicz
Towards Automatic Gesture Stroke Detection

L12-1241  [bib]: Richard Johansson; Karin Friberg Heppin; Dimitrios Kokkinakis
Semantic Role Labeling with the Swedish FrameNet

L12-1242  [bib]: Loganathan Ramasamy; Zdeněk Žabokrtský
Prague Dependency Style Treebank for Tamil

L12-1243  [bib]: Jörg Tiedemann; Dorte Haltrup Hansen; Lene Offersgaard; Sussi Olsen; Matthias Zumpe
A Distributed Resource Repository for Cloud-Based Machine Translation

L12-1244  [bib]: Patricia Gonçalves; Rita Santos; António Branco
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese

L12-1245  [bib]: David Graff; Mohamed Maamouri
Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects

L12-1246  [bib]: Jörg Tiedemann
Parallel Data, Tools and Interfaces in OPUS

L12-1247  [bib]: Elias Iosif; Alexandros Potamianos
SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks

L12-1248  [bib]: Emad Mohamed; Behrang Mohit; Kemal Oflazer
Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic

L12-1249  [bib]: Julia Maria Schulz; Daniela Becks; Christa Womser-Hacker; Thomas Mandl
A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content

L12-1250  [bib]: Mathieu-Henri Falco; Véronique Moriceau; Anne Vilnat
Kitten: a tool for normalizing HTML and extracting its textual content

L12-1251  [bib]: Emanuel Dima; Christina Hoppermann; Erhard Hinrichs; Thorsten Trippel; Claus Zinn
A Metadata Editor to Support the Description of Linguistic Resources

L12-1252  [bib]: Luz Rello; Iria Gayo
A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality

L12-1253  [bib]: Emanuel Dima; Verena Henrich; Erhard Hinrichs; Marie Hinrichs; Christina Hoppermann; Thorsten Trippel; Thomas Zastrow; Claus Zinn
A Repository for the Sustainable Management of Research Data

L12-1254  [bib]: Montserrat Arza; José M. García-Miguel; Francisco Campillo; Miguel Cuevas - Alonso
A Galician Syntactic Corpus with Application to Intonation Modeling

L12-1255  [bib]: Tafseer Ahmed; Miriam Butt; Annette Hautli; Sebastian Sulger
A Reference Dependency Bank for Analyzing Complex Predicates

L12-1256  [bib]: Judith Eckle-Kohler; Iryna Gurevych; Silvana Hartmann; Michael Matuschek; Christian M. Meyer
UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF

L12-1257  [bib]: Thomas Eckart; Uwe Quasthoff; Dirk Goldhahn
The Influence of Corpus Quality on Statistical Measurements on Language Resources

L12-1258  [bib]: Marianna Apidianaki; Benoît Sagot
Applying cross-lingual WSD to wordnet development

L12-1259  [bib]: Pierrette Bouillon; Elisabetta Jezek; Chiara Melloni; Aurélie Picton
Annotating Qualia Relations in Italian and French Complex Nominals

L12-1260  [bib]: Mark Fishel; Ondřej Bojar; Maja Popović
Terra: a Collection of Translation Error-Annotated Corpora

L12-1261  [bib]: Olga Babko-Malaya; Greg Milette; Michael Schneider; Sarah Scogin
Identifying Nuggets of Information in GALE Distillation Evaluation

L12-1262  [bib]: Ibrahim Saygin Topkaya; Hakan Erdogan
SUTAV: A Turkish Audio-Visual Database

L12-1263  [bib]: Sergey Zablotskiy; Alexander Shvets; Maxim Sidorov; Eugene Semenkin; Wolfgang Minker
Speech and Language Resources for LVCSR of Russian

L12-1264  [bib]: Luis Javier Rodriguez-Fuentes; Mikel Penagarikano; Amparo Varona; Mireia Diez; German Bordel
KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments

L12-1265  [bib]: Stephen Grimes; Katherine Peterson; Xuansong Li
Automatic word alignment tools to scale production of manually aligned parallel texts

L12-1266  [bib]: Jolanta Bachan
Developing and evaluating an emergency scenario dialogue corpus

L12-1267  [bib]: Robert Dale; George Narroway
A Framework for Evaluating Text Correction

L12-1268  [bib]: Gregor Thurmair; Vera Aleksic; Christoph Schwarz
Large Scale Lexical Analysis

L12-1269  [bib]: Chieh-Jen Wang; Shuk-Man Cheng; Lung-Hao Lee; Hsin-Hsi Chen; Wen-shen Liu; Pei-Wen Huang; Shih-Peng Lin
NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks

L12-1270  [bib]: Guillaume Gravier; Gilles Adda; Niklas Paulsson; Matthieu Carré; Aude Giraudel; Olivier Galibert
The ETAPE corpus for the evaluation of speech-based TV content processing in the French language

L12-1271  [bib]: Núria Bel; Lauren Romeo; Muntsa Padró
Automatic lexical semantic classification of nouns

L12-1272  [bib]: Ernesto William De Luca
Is it Useful to Support Users with Lexical Resources? A User Study.

L12-1273  [bib]: Rita Marinelli; Laura Cignoni
In the same boat and other idiomatic seafaring expressions

L12-1274  [bib]: Ulrich Andersen; Anna Braasch; Lina Henriksen; Csaba Huszka; Anders Johannsen; Lars Kayser; Bente Maegaard; Ole Norgaard; Stefan Schulz; Jürgen Wedekind
Creation and use of Language Resources in a Question-Answering eHealth System

L12-1275  [bib]: Lina M. Rojas-Barahona; Alejandra Lorenzo; Claire Gardent
Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents

L12-1276  [bib]: Marion Potet; Emmanuelle Esperança-Rodier; Laurent Besacier; Hervé Blanchon
Collection of a Large Database of French-English SMT Output Corrections

L12-1277  [bib]: Boris Haselbach; Wolfgang Seeker; Kerstin Eckart
German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data

L12-1278  [bib]: Els Lefever; Veronique Hoste; Martine De Cock
Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation

L12-1279  [bib]: Saab Mansour; Hermann Ney
Arabic-Segmentation Combination Strategies for Statistical Machine Translation

L12-1280  [bib]: Jan Hajič; Eva Hajičová; Jarmila Panevová; Petr Sgall; Ondřej Bojar; Silvie Cinková; Eva Fučíková; Marie Mikulová; Petr Pajas; Jan Popelka; Jiří Semecký; Jana Šindlerová; Jan Štěpánek; Josef Toman; Zdeňka Urešová; Zdeněk Žabokrtský
Announcing Prague Czech-English Dependency Treebank 2.0

L12-1281  [bib]: Paul Felt; Eric Ringger; Kevin Seppi; Kristian Heal; Robbie Haertel; Deryle Lonsdale
First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis

L12-1282  [bib]: Emina Kurtic; Bill Wells; Guy J. Brown; Timothy Kempton; Ahmet Aker
A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English

L12-1283  [bib]: Antton Gurrutxaga; Iñaki Alegria
Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques

L12-1284  [bib]: Jorge Vivaldi; Luis Adrián Cabrera-Diego; Gerardo Sierra; María Pozzi
Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks

L12-1285  [bib]: Javier Caminero; Mari Carmen Rodríguez; Jean Vanderdonckt; Fabio Paternò; Joerg Rett; Dave Raggett; Jean-Loup Comeliau; Ignacio Marín
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends

L12-1286  [bib]: Amalia Todirascu; Sebastian Pado; Jennifer Krisch; Max Kisselew; Ulrich Heid
French and German Corpora for Audience-based Text Type Classification

L12-1287  [bib]: Montserrat Marimon; Beatríz Fisas; Núria Bel; Marta Villegas; Jorge Vivaldi; Sergi Torner; Mercè Lorente; Silvia Vázquez; Marta Villegas
The IULA Treebank

L12-1288  [bib]: Iris Hendrickx; Amália Mendes; Silvia Mencarelli
Modality in Text: a Proposal for Corpus Annotation

L12-1289  [bib]: Maria Skeppstedt; Maria Kvist; Hercules Dalianis
Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text

L12-1290  [bib]: Md. Faisal Mahbub Chowdhury; Alberto Lavelli
An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction

L12-1291  [bib]: Herman Stehouwer; Matej Durco; Eric Auer; Daan Broeder
Federated Search: Towards a Common Search Infrastructure

L12-1292  [bib]: Elsa Tolone; Benoît Sagot; Éric Villemonte de la Clergerie
Evaluating and improving syntactic lexica by plugging them within a parser

L12-1293  [bib]: Jing Guang Han; Emer Gilmartin; Celine DeLooze; Brian Vaughan; Nick Campbell
The Herme Database of Spontaneous Multimodal Human-Robot Dialogues

L12-1294  [bib]: Víctor M. Sánchez-Cartagena; Miquel Esplà-Gomis; Juan Antonio Pérez-Ortiz
Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation

L12-1295  [bib]: Thomas Schmidt
EXMARaLDA and the FOLK tools ― two toolsets for transcribing and annotating spoken language

L12-1296  [bib]: Harry Bunt; Jan Alexandersson; Jae-Woong Choe; Alex Chengyu Fang; Koiti Hasida; Volha Petukhova; Andrei Popescu-Belis; David Traum
ISO 24617-2: A semantically-based standard for dialogue annotation

L12-1297  [bib]: Inga Gheorghita; Jean-Marie Pierrel
Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary

L12-1298  [bib]: Natalia Konstantinova; Sheila C.M. de Sousa; Noa P. Cruz; Manuel J. Maña; Maite Taboada; Ruslan Mitkov
A review corpus annotated for negation, speculation and their scope

L12-1299  [bib]: Valerio Basile; Johan Bos; Kilian Evang; Noortje Venhuizen
Developing a large semantically annotated corpus

L12-1300  [bib]: Marc Kemps-Snijders; Matthijs Brouwer; Jan Pieter Kunst; Tom Visser
Dynamic web service deployment in a cloud environment

L12-1301  [bib]: Elias Iosif; Maria Giannoudaki; Eric Fosler-Lussier; Alexandros Potamianos
Associative and Semantic Features Extracted From Web-Harvested Corpora

L12-1302  [bib]: Maaske Treurniet; Orphée De Clercq; Henk van den Heuvel; Nelleke Oostdijk
Collection of a corpus of Dutch SMS

L12-1303  [bib]: Claudia Marzi; Marcello Ferro; Claudia Caudai; Vito Pirrelli
Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access

L12-1304  [bib]: Nikos Tsourakis; Manny Rayner
A Corpus for a Gesture-Controlled Mobile Spoken Dialogue System

L12-1305  [bib]: Marc Poch; Antonio Toral; Olivier Hamon; Valeria Quochi; Núria Bel
Towards a User-Friendly Platform for Building Language Resources based on Web Services

L12-1306  [bib]: John McCrae; Elena Montiel-Ponsoda; Philipp Cimiano
Collaborative semantic editing of linked data lexica

L12-1307  [bib]: Pablo Mendes; Joachim Daiber; Rohana Rajapakse; Felix Sasaki; Christian Bizer
Evaluating the Impact of Phrase Recognition on Concept Tagging

L12-1308  [bib]: Willem Elbers; Daan Broeder; Dieter van Uytvanck
Proper Language Resource Centers

L12-1309  [bib]: Alessandro Panunzi; Marco Fabbri; Massimo Moneglia; Lorenzo Gregori; Samuele Paladini
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building

L12-1310  [bib]: Karën Fort; Claire François; Olivier Galibert; Maha Ghribi
Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign

L12-1311  [bib]: Dietmar Rösner; Jörg Frommer; Rafael Friesen; Matthias Haase; Julia Lange; Mirko Otto
LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions

L12-1312  [bib]: Juliette Thuilier; Laurence Danlos
Semantic annotation of French corpora: animacy and verb semantic classes

L12-1313  [bib]: Wei Wang; Romaric Besançon; Olivier Ferret; Brigitte Grau
Evaluation of Unsupervised Information Extraction

L12-1314  [bib]: Houda Bouamor; Aurélien Max; Gabriel Illouz; Anne Vilnat
A contrastive review of paraphrase acquisition techniques

L12-1315  [bib]: Mohamed Maamouri; Ann Bies; Seth Kulick
Expanding Arabic Treebank to Speech: Results from Broadcast News

L12-1316  [bib]: Paul Rodrigues; C. Anton Rytting
Typing Race Games as a Method to Create Spelling Error Corpora

L12-1317  [bib]: Anita Alicante; Cristina Bosco; Anna Corazza; Alberto Lavelli
A treebank-based study on the influence of Italian word order on parsing performance

L12-1318  [bib]: Kallirroi Georgila; Alan Black; Kenji Sagae; David Traum
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems

L12-1319  [bib]: Hiroaki Sato
A Search Tool for FrameNet Constructicon

L12-1320  [bib]: Stéphanie Weiser; Patrick Watrin
Extraction of unmarked quotations in Newspapers

L12-1321  [bib]: Christophe Roche
Ontoterminology: How to unify terminology and ontology into a single paradigm

L12-1322  [bib]: Donia Scott; Rossano Barone; Rob Koeling
Corpus Annotation as a Scientific Task

L12-1323  [bib]: Pablo Mendes; Max Jakob; Christian Bizer
DBpedia: A Multilingual Cross-domain Knowledge Base

L12-1324  [bib]: Cheikh M. Bamba Dione
A Morphological Analyzer For Wolof Using Finite-State Techniques

L12-1325  [bib]: Leonardo Campillos Llanos
Designing a search interface for a Spanish learner spoken corpus: the end-user's evaluation

L12-1326  [bib]: Carla Parra Escartín
Design and compilation of a specialized Spanish-German parallel corpus

L12-1327  [bib]: Minoru Sasaki; Hiroyuki Shinnou
Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples

L12-1328  [bib]: Nizar Habash; Mona Diab; Owen Rambow
Conventional Orthography for Dialectal Arabic

L12-1329  [bib]: Daan Broeder; Dieter van Uytvanck; Maria Gavrilidou; Thorsten Trippel; Menzo Windhouwer
Standardizing a Component Metadata Infrastructure

L12-1330  [bib]: Ahmet Aker; Mahmoud El-Haj; M-Dyaa Albakour; Udo Kruschwitz
Assessing Crowdsourcing Quality through Objective Tasks

L12-1331  [bib]: Sabine Schulte im Walde; Susanne Borgwaldt; Ronny Jauch
Association Norms of German Noun Compounds

L12-1332  [bib]: Bharat Ram Ambati; Siva Reddy; Adam Kilgarriff
Word Sketches for Turkish

L12-1333  [bib]: Eckhard Bick; Heliana Mello; Alessandro Panunzi; Tommaso Raso
The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser

L12-1334  [bib]: Svetla Koeva; Ivelina Stoyanova; Rositsa Dekova; Borislav Rizov; Angel Genov
Bulgarian X-language Parallel Corpus

L12-1335  [bib]: Rebecca J. Passonneau; Collin F. Baker; Christiane Fellbaum; Nancy Ide
The MASC Word Sense Corpus

L12-1336  [bib]: Cristina Mota; Alberto Simões; Cláudia Freitas; Luís Costa; Diana Santos
Págico: Evaluating Wikipedia-based information retrieval in Portuguese

L12-1337  [bib]: Alexandre Denis; Ingrid Falk; Claire Gardent; Laura Perez-Beltrachini
Representation of linguistic and domain knowledge for second language learning in virtual worlds

L12-1338  [bib]: Xin Zuo; Tian Li; Pascale Fung
A Multilingual Natural Stress Emotion Database

L12-1339  [bib]: Priti Aggarwal; Ron Artstein; Jillian Gerten; Athanasios Katsamanis; Shrikanth Narayanan; Angela Nazarian; David Traum
The Twins Corpus of Museum Visitor Questions

L12-1340  [bib]: Chi-Hsin Yu; Yi-jie Tang; Hsin-Hsi Chen
Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information

L12-1341  [bib]: Doaa Samy; Antonio Moreno-Sandoval; Conchi Bueno-Díaz; Marta Garrote-Salazar; José M. Guirao
Medical Term Extraction in an Arabic Medical Corpus

L12-1342  [bib]: Michael Kipp
Annotation Facilities for the Reliable Analysis of Human Motion

L12-1343  [bib]: Darja Fišer; Nikola Ljubešić; Ozren Kubelka
Addressing polysemy in bilingual lexicon extraction from comparable corpora

L12-1344  [bib]: Khaled Shaalan; Mohammed Attia; Pavel Pecina; Younes Samih; Josef van Genabith
Arabic Word Generation and Modelling for Spell Checking

L12-1345  [bib]: Yasuharu Den; Hanae Koiso; Katsuya Takanashi; Nao Yoshida
Annotation of response tokens and their triggering expressions in Japanese multi-party conversations

L12-1346  [bib]: Enikő Héja; Dávid Takács
Automatically Generated Online Dictionaries

L12-1347  [bib]: Takahiro Miyajima; Hideaki Kikuchi; Katsuhiko Shirai; Shigeki Okawa
Method for Collection of Acted Speech Using Various Situation Scripts

L12-1348  [bib]: Daan Broeder; Dieter van Uytvanck; Gunter Senft
Citing on-line Language Resources

L12-1349  [bib]: Mohammed Attia; Khaled Shaalan; Lamia Tounsi; Josef van Genabith
Automatic Extraction and Evaluation of Arabic LFG Resources

L12-1350  [bib]: Matthieu Constant; Isabelle Tellier
Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger

L12-1351  [bib]: Brett Drury; José João Almeida
The Minho Quotation Resource

L12-1352  [bib]: Michael Carl
Translog-II: a Program for Recording User Activity Data for Empirical Reading and Writing Research

L12-1353  [bib]: Brian MacWhinney
Morphosyntactic Analysis of the CHILDES and TalkBank Corpora

L12-1354  [bib]: Jan Rygl; Aleš Horák
Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification

L12-1355  [bib]: Ritesh Kumar
Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi

L12-1356  [bib]: Alessandro Lenci; Gabriella Lapesa; Giulia Bonansinga
LexIt: A Computational Resource on Italian Argument Structure

L12-1357  [bib]: Karën Fort; Vincent Claveau
Annotating Football Matches: Influence of the Source Medium on Manual Annotation

L12-1358  [bib]: Tommaso Raso; Heliana Mello; Maryualê Malvessi Mittmann
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese

L12-1359  [bib]: Ahmet Aker; Evangelos Kanoulas; Robert Gaizauskas
A light way to collect comparable corpora from the Web

L12-1360  [bib]: Maite Melero; Marta R. Costa-Jussà; Judith Domingo; Montse Marquina; Martí Quixal
Holaaa!! writin like u talk is kewl but kinda hard 4 NLP

L12-1361  [bib]: Danica Damljanovic; Udo Kruschwitz; M-Dyaa Albakour; Johann Petrak; Mihai Lupu
Applying Random Indexing to Structured Data to Find Contextually Similar Words

L12-1362  [bib]: Marilisa Amoia; Kerstin Kunz; Ekaterina Lapshinova-Koltunski
Coreference in Spoken vs. Written Texts: a Corpus-based Analysis

L12-1363  [bib]: Olivier Boeffard; Laure Charonnat; Sébastien Le Maguer; Damien Lolive
Towards Fully Automatic Annotation of Audio Books for TTS

L12-1364  [bib]: Ian Lewin; Şenay Kafkas; Dietrich Rebholz-Schuhmann
Centroids: Gold standards with distributional variation

L12-1365  [bib]: Costanza Navarretta; Patrizia Paggio
Multimodal Behaviour and Feedback in Different Types of Interaction

L12-1366  [bib]: David Lewis; Alexander O'Connor; Andrzej Zydroń; Gerd Sjögren; Rahzeb Choudhury
On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market

L12-1367  [bib]: Ivana Tanasijević; Biljana Sikimić; Gordana Pavlović-Lažetić
Multimedia database of the cultural heritage of the Balkans

L12-1368  [bib]: Frederic Landragin; Thierry Poibeau; Bernard Victorri
ANALEC: a New Tool for the Dynamic Annotation of Textual Data

L12-1369  [bib]: Costanza Navarretta; Elisabeth Ahlsén; Jens Allwood; Kristiina Jokinen; Patrizia Paggio
Feedback in Nordic First-Encounters: a Comparative Study

L12-1370  [bib]: Hong Li; Xiwen Cheng; Kristina Adson; Tal Kirshboim; Feiyu Xu
Annotating Opinions in German Political News

L12-1371  [bib]: Yu Chen; Andreas Eisele
MultiUN v2: UN Documents with Multilingual Alignments

L12-1372  [bib]: Horacio Saggion; Sandra Szasz
The CONCISUS Corpus of Event Summaries

L12-1373  [bib]: Gracinda Carvalho; David Martins de Matos; Vitor Rocio
Building and Exploring Semantic Equivalences Resources

L12-1374  [bib]: Septina Dian Larasati
IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus

L12-1375  [bib]: Ondřej Bojar; Zdeněk Žabokrtský; Ondřej Dušek; Petra Galuščáková; Martin Majliš; David Mareček; Jiří Maršík; Michal Novák; Martin Popel; Aleš Tamchyna
The Joy of Parallelism with CzEng 1.0

L12-1376  [bib]: Kais Dukes; Eric Atwell
LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis

L12-1377  [bib]: Maciej Ogrodniczuk; Michał Lenart
Web Service integration platform for Polish linguistic resources

L12-1378  [bib]: Casey Redd Kennington; Martin Kay; Annemarie Friedrich
Suffix Trees as Language Models

L12-1379  [bib]: Liviu P. Dinu; Vlad Niculae; Octavia-Maria Şulea
The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System

L12-1380  [bib]: Soojeong Eom; Markus Dickinson; Graham Katz
Using semi-experts to derive judgments on word sense alignment: a pilot study

L12-1381  [bib]: Maciej Ogrodniczuk
The Polish Sejm Corpus

L12-1382  [bib]: Dawn Lawrie; James Mayfield; Paul McNamee; Douglas Oard
Creating and Curating a Cross-Language Person-Entity Linking Collection

L12-1383  [bib]: Marc Verhagen; James Pustejovsky
The TARSQI Toolkit

L12-1384  [bib]: Annie Louis; Ani Nenkova
A corpus of general and specific sentences from news

L12-1385  [bib]: Jonathan Wright; Kira Griffitt; Joe Ellis; Stephanie Strassel; Brendan Callahan
Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure

L12-1386  [bib]: Elena Filatova
Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing

L12-1387  [bib]: Rania Al-Sabbagh; Roxana Girju
YADAC: Yet another Dialectal Arabic Corpus

L12-1388  [bib]: James Clarke; Vivek Srikumar; Mark Sammons; Dan Roth
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)

L12-1389  [bib]: Luís Marujo; Anatole Gershman; Jaime Carbonell; Robert Frederking; João P. Neto
Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization

L12-1390  [bib]: Akshat Bakliwal; Piyush Arora; Vasudeva Varma
Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification

L12-1391  [bib]: Marta Recasens; M. Antònia Martí; Constantin Orasan
Annotating Near-Identity from Coreference Disagreements

L12-1392  [bib]: Takenobu Tokunaga; Ryu Iida; Asuka Terai; Naoko Kuriyama
The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues

L12-1393  [bib]: Takanori Kusumoto; Tomoyosi Akiba
Statistical Machine Translation without Source-side Parallel Corpus Using Word Lattice and Phrase Extension

L12-1394  [bib]: Anne-Kathrin Schumann
Knowledge-Rich Context Extraction and Ranking with KnowPipe

L12-1395  [bib]: Gozde Ozbal; Carlo Strapparava; Marco Guerini
Brand Pitt: A Corpus to Explore the Art of Naming

L12-1396  [bib]: Orphée De Clercq; Veronique Hoste; Paola Monachesi
Evaluating automatic cross-domain Dutch semantic role annotation

L12-1397  [bib]: Thierry Bazillon; Melanie Deplano; Frederic Bechet; Alexis Nasr; Benoit Favre
Syntactic annotation of spontaneous speech: application to call-center conversation data

L12-1398  [bib]: Hyejin Hong; Sunhee Kim; Minhwa Chung
Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability

L12-1399  [bib]: Frederic Bechet; Benjamin Maza; Nicolas Bigouroux; Thierry Bazillon; Marc El-Beze; Renato De Mori; Eric Arbillot
DECODA: a call-centre human-human spoken conversation corpus

L12-1400  [bib]: Yves Scherrer; Bruno Cartoni
The Trilingual ALLEGRA Corpus: Presentation and Possible Use for Lexicon Induction

L12-1401  [bib]: Giovanni Costantini; Andrea Paoloni; Massimiliano Todisco
Intelligibility assessment in forensic applications

L12-1402  [bib]: Gabriella Pardelli; Manuela Sassi; Sara Goggi; Stefania Biagioni
From medical language processing to BioNLP domain

L12-1403  [bib]: Tomoyosi Akiba; Hiromitsu Nishizaki; Kiyoaki Aikawa; Tatsuya Kawahara; Tomoko Matsui
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task

L12-1404  [bib]: Antonio Moreno-Sandoval; Leonardo Campillos Llanos; Yang Dong; Emi Takamori; José M. Guirao; Paula Gozalo; Chieko Kimura; Kengo Matsui; Marta Garrote-Salazar
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese

L12-1405  [bib]: Anthony Rousseau; Paul Deléglise; Yannick Estève
TED-LIUM: an Automatic Speech Recognition dedicated corpus

L12-1406  [bib]: Georgios Petasis
The SYNC3 Collaborative Annotation Tool

L12-1407  [bib]: Valérie Mapelli; Victoria Arranz; Matthieu Carré; Hélène Mazo; Djamel Mostefa; Khalid Choukri
ELRA in the heart of a cooperative HLT world

L12-1408  [bib]: Lambert Patrik; Holger Schwenk; Frédéric Blain
Automatic Translation of Scientific Documents in the HAL Archive

L12-1409  [bib]: Borut Sluban; Senja Pollak; Roel Coesemans; Nada Lavrac
Irregularity Detection in Categorized Document Corpora

L12-1410  [bib]: Aude Giraudel; Matthieu Carré; Valérie Mapelli; Juliette Kahn; Olivier Galibert; Ludovic Quintard
The REPERE Corpus : a multimodal corpus for person recognition

L12-1411  [bib]: Andrea Gesmundo; Tanja Samardzic
Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification

L12-1412  [bib]: Thomas Proisl; Peter Uhrig
Efficient Dependency Graph Matching with the IMS Open Corpus Workbench

L12-1413  [bib]: David Tavarez; Eva Navas; Daniel Erro; Ibon Saratxaga
Strategies to Improve a Speaker Diarisation Tool

L12-1414  [bib]: Atsushi Fujii; Yuya Fujii; Takenobu Tokunaga
Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions

L12-1415  [bib]: Miguel Ballesteros; Joakim Nivre
MaltOptimizer: A System for MaltParser Optimization

L12-1416  [bib]: Alistair Conkie; Thomas Okken; Yeon-Jun Kim; Giuseppe Di Fabbrizio
Building Text-To-Speech Voices in the Cloud

L12-1417  [bib]: Sara Stymne; Lars Ahrenberg
On the practice of error analysis for machine translation evaluation

L12-1418  [bib]: Dasa Berovic; Zeljko Agic; Marko Tadić
Croatian Dependency Treebank: Recent Development and Initial Experiments

L12-1419  [bib]: Tina Kluewer; Feiyu Xu; Peter Adophs; Hans Uszkoreit
Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World

L12-1420  [bib]: Marta Villegas; Núria Bel; Carlos Gonzalo; Amparo Moreno; Nuria Simelio
Using Language Resources in Humanities research

L12-1421  [bib]: Petya Osenova; Kiril Simov; Laska Laskova; Stanislava Kancheva
A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian

L12-1422  [bib]: Francesco Rubino; Francesca Frontini; Valeria Quochi
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser

L12-1423  [bib]: Shaohua Yang; Hai Zhao; Xiaolin Wang; Bao-liang Lu
Spell Checking for Chinese

L12-1424  [bib]: Zahurul Islam; Alexander Mehler
Customization of the Europarl Corpus for Translation Studies

L12-1425  [bib]: Carlo Strapparava; Rada Mihalcea; Alberto Battocchi
A Parallel Corpus of Music and Lyrics Annotated with Emotions

L12-1426  [bib]: Elisa Bianchi; Mirko Tavosanis; Emiliano Giovannetti
Creation of a bottom-up corpus-based ontology for Italian Linguistics

L12-1427  [bib]: Sebastian Nordhoff; Harald Hammarström
Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages

L12-1428  [bib]: Carmen Dayrell; Arnaldo Candido Jr.; Gabriel Lima; Danilo Machado Jr.; Ann Copestake; Valéria Feltrim; Stella Tagnin; Sandra Aluisio
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora

L12-1429  [bib]: Martin Aleksandrov; Carlo Strapparava
NgramQuery - Smart Information Extraction from Google N-gram using External Resources

L12-1430  [bib]: Sandra Weiss; Lars Ahrenberg
Error profiling for evaluation of machine-translated text: a Polish-English case study

L12-1431  [bib]: Magdalena Lis
Polish Multimodal Corpus ― a collection of referential gestures

L12-1432  [bib]: Annelies Braffort; Leïla Boutora
DEGELS1: A comparable corpus of French Sign Language and co-speech gestures

L12-1433  [bib]: Matilde Gonzalez; Michael Filhol; Christophe Collet
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs

L12-1434  [bib]: Georgi Iliev; Angel Genov
Expanding Parallel Resources for Medium-Density Languages for Free

L12-1435  [bib]: Andrejs Vasiljevs; Markus Forsberg; Tatiana Gornostay; Dorte Haltrup Hansen; Kristín Jóhannsdóttir; Gunn Lyse; Krister Lindén; Lene Offersgaard; Sussi Olsen; Bolette Pedersen; Eiríkur Rögnvaldsson; Inguna Skadiņa; Koenraad De Smedt; Ville Oksanen; Roberts Rozis
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries

L12-1436  [bib]: Anita Gojun; Ulrich Heid; Bernd Weißbach; Carola Loth; Insa Mingers
Adapting and evaluating a generic term extraction tool

L12-1437  [bib]: Martin Reynaert; Ineke Schuurman; Veronique Hoste; Nelleke Oostdijk; Maarten van Gompel
Beyond SoNaR: towards the facilitation of large corpus building efforts

L12-1438  [bib]: Fabrice Lefèvre; Djamel Mostefa; Laurent Besacier; Yannick Estève; Matthieu Quignard; Nathalie Camelin; Benoit Favre; Bassam Jabaian; Lina M. Rojas-Barahona
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora

L12-1439  [bib]: Rahul Agarwal; Bharat Ram Ambati; Anil Kumar Singh
A GUI to Detect and Correct Errors in Hindi Dependency Treebank

L12-1440  [bib]: Jordi Atserias; Maria Fuentes; Rogelio Nazar; Irene Renau
Spell Checking in Spanish: The Case of Diacritic Accents

L12-1441  [bib]: Udo Hahn; Elena Beisswanger; Ekaterina Buyko; Erik Faessler; Jenny Traumüller; Susann Schröder; Kerstin Hornbostel
Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena

L12-1442  [bib]: Liesbeth Augustinus; Vincent Vandeghinste; Frank Van Eynde
Example-Based Treebank Querying

L12-1443  [bib]: Alvin Grissom II; Yusuke Miyao
Annotating Factive Verbs

L12-1444  [bib]: Markus Dickinson; Scott Ledbetter
Annotating Errors in a Hungarian Learner Corpus

L12-1445  [bib]: Romaric Besançon; Olivier Ferret; Ludovic Jean-Louis
Evaluation of a Complex Information Extraction Application in Specific Domain

L12-1446  [bib]: Stefan Bott; Horacio Saggion; Simon Mille
Text Simplification Tools for Spanish

L12-1447  [bib]: Elisabet Comelles; Jordi Atserias; Victoria Arranz; Irene Castellón
VERTa: Linguistic features in MT evaluation

L12-1448  [bib]: Atro Voutilainen; Kristiina Muhonen; Tanja Purtonen; Krister Lindén
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3

L12-1449  [bib]: Francesco Cutugno; Vincenza Anna Leano; Antonio Origlia
W-PhAMT: A web tool for phonetic multilevel timeline visualization

L12-1450  [bib]: Nicoletta Calzolari; Riccardo Del Gratta; Gil Francopoulo; Joseph Mariani; Francesco Rubino; Irene Russo; Claudia Soria
The LRE Map. Harmonising Community Descriptions of Resources

L12-1451  [bib]: Corina Forascu; Dan Tufiș
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information

L12-1452  [bib]: Matteo Negri; Yashar Mehdad; Alessandro Marchetti; Danilo Giampiccolo; Luisa Bentivogli
Chinese Whispers: Cooperative Paraphrase Acquisition

L12-1453  [bib]: Janne Bondi Johannessen; Joel Priestley; Kristin Hagen; Anders Nøklestad; André Lynum
The Nordic Dialect Corpus

L12-1454  [bib]: Jonathon Read; Dan Flickinger; Rebecca Dridan; Stephan Oepen; Lilja Øvrelid
The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content

L12-1455  [bib]: František Cvrček; Karel Pala; Pavel Rychlý
Legal electronic dictionary for Czech

L12-1456  [bib]: Thomas Kaspersson; Christian Smith; Henrik Danielsson; Arne Jönsson
This also affects the context - Errors in extraction based summaries

L12-1457  [bib]: Claudia Soria; Núria Bel; Khalid Choukri; Joseph Mariani; Monica Monachini; Jan Odijk; Stelios Piperidis; Valeria Quochi; Nicoletta Calzolari
The FLaReNet Strategic Language Resource Agenda

L12-1458  [bib]: Josef Ruppenhofer; Ines Rehbein
Yes we can!? Annotating English modal verbs

L12-1459  [bib]: Merlin Teodosia Suarez; Jocelynn Cu; Madelene Sta. Maria
Building a Multimodal Laughter Database for Emotion Recognition

L12-1460  [bib]: Jörg Frommer; Bernd Michaelis; Dietmar Rösner; Andreas Wendemuth; Rafael Friesen; Matthias Haase; Manuela Kunze; Rico Andrich; Julia Lange; Axel Panning; Ingo Siegert
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus

L12-1461  [bib]: Attila Zséder; Gábor Recski; Dániel Varga; András Kornai
Rapid creation of large-scale corpora and frequency dictionaries

L12-1462  [bib]: Amir Hazem; Emmanuel Morin
Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora

L12-1463  [bib]: Zhiyi Song; Safa Ismael; Stephen Grimes; David Doermann; Stephanie Strassel
Linguistic Resources for Handwriting Recognition and Translation Evaluation

L12-1464  [bib]: Antonio Origlia; Iolanda Alfano
Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi fication

L12-1465  [bib]: Helmer Strik; Jozef Colpaert; Joost Van Doremalen; Catia Cucchiarini
The DISCO ASR-based CALL system: practicing L2 oral skills and beyond

L12-1466  [bib]: Utku Şirin; Ruket Çakıcı; Deniz Zeyrek
METU Turkish Discourse Bank Browser

L12-1467  [bib]: Piotr Bański; Peter M. Fischer; Elena Frick; Erik Ketzan; Marc Kupietz; Carsten Schnober; Oliver Schonefeld; Andreas Witt
The New IDS Corpus Analysis Platform: Challenges and Prospects

L12-1468  [bib]: Silvia Quarteroni; Vincenzo Guerrisi; Pietro La Torre
Evaluating Multi-focus Natural Language Queries over Data Services

L12-1469  [bib]: Maria Teresa Pazienza; Noemi Scarpato; Armando Stellato
Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation

L12-1470  [bib]: Mladen Karan; Jan Šnajder; Bojana Dalbelo Bašić
Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian

L12-1471  [bib]: Elena Frick; Carsten Schnober; Piotr Bański
Evaluating Query Languages for a Corpus Processing System

L12-1472  [bib]: Chunqi Shi; Donghui Lin; Masahiko Shimada; Toru Ishida
Two Phase Evaluation for Selecting Machine Translation Services

L12-1473  [bib]: Fangzhong Su; Bogdan Babych
Development and Application of a Cross-language Document Comparability Metric

L12-1474  [bib]: Benjamin Weitz; Ulrich Schäfer
A Graphical Citation Browser for the ACL Anthology

L12-1475  [bib]: Stephen Wattam; Paul Rayson; Damon Berridge
Document Attrition in Web Corpora: an Exploration

L12-1476  [bib]: Anja Belz; Albert Gatt
A Repository of Data and Evaluation Resources for Natural Language Generation

L12-1477  [bib]: Chunqi Shi; Donghui Lin; Toru Ishida
Service Composition Scenarios for Task-Oriented Translation

L12-1478  [bib]: Yoshinobu Kano
Towards automation in using multi-modal language resources: compatibility and interoperability for multi-modal features in Kachako

L12-1479  [bib]: Thibault Mondary; Adeline Nazarenko; Haïfa Zargayouna; Sabine Barreaux
The Quaero Evaluation Initiative on Term Extraction

L12-1480  [bib]: Lorenza Russo; Sharid Loáiciga; Asheesh Gulati
Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.

L12-1481  [bib]: Ralf Steinberger; Andreas Eisele; Szymon Klocek; Spyridon Pilos; Patrick Schlüter
DGT-TM: A freely available Translation Memory in 22 languages

L12-1482  [bib]: Heba Elfardy; Mona Diab
Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations

L12-1483  [bib]: Christian Federmann; Ioanna Giannopoulou; Christian Girardi; Olivier Hamon; Dimitris Mavroeidis; Salvatore Minutoli; Marc Schröder
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools

L12-1484  [bib]: Marion Weller; Ulrich Heid
Analyzing and Aligning German compound nouns

L12-1485  [bib]: Raheel Nawaz; Paul Thompson; Sophia Ananiadou
Identification of Manner in Bio-Events

L12-1486  [bib]: Kurt Eberle; Kerstin Eckart; Ulrich Heid; Boris Haselbach
A Tool/Database Interface for Multi-Level Analyses

L12-1487  [bib]: Thierry Declerck; Karlheinz Mörth; Piroska Lendvai
Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies

L12-1488  [bib]: Igor Odriozola; Eva Navas; Inma Hernáez; Iñaki Sainz; Ibon Saratxaga; Jon Sánchez; Daniel Erro
Using an ASR database to design a pronunciation evaluation system in Basque

L12-1489  [bib]: Ramona Bongelli; Carla Canestrari; Ilaria Riccioni; Andrzej Zuczkowski; Cinzia Buldorini; Ricardo Pietrobon; Alberto Lavelli; Bernardo Magnini
A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty

L12-1490  [bib]: Djamel Mostefa; Khalid Choukri; Sylvie Brunessaux; Karim Boudahmane
New language resources for the Pashto language

L12-1491  [bib]: Ting Liu; Samira Shaikh; Tomek Strzalkowski; Aaron Broadwell; Jennifer Stromer-Galley; Sarah Taylor; Umit Boz; Xiaoai Ren; Jingsi Wu
Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language

L12-1492  [bib]: Şenay Kafkas; Ian Lewin; David Milward; Erik van Mulligen; Jan Kors; Udo Hahn; Dietrich Rebholz-Schuhmann
CALBC: Releasing the Final Corpora

L12-1493  [bib]: Jordi Adell; Antonio Bonafonte; Antonio Cardenal; Marta R. Costa-Jussà; José A. R. Fonollosa; Asunción Moreno; Eva Navas; Eduardo R. Banga
BUCEADOR, a multi-language search engine for digital libraries

L12-1494  [bib]: Aleksandar Savkov; Laska Laskova; Stanislava Kancheva; Petya Osenova; Kiril Simov
Linguistic Analysis Processing Line for Bulgarian

L12-1495  [bib]: Jirka Hana; Barbora Hladka
Getting more data -- Schoolkids as annotators

L12-1496  [bib]: Matteo Abrate; Clara Bacciu
Visualizing word senses in WordNet Atlas

L12-1497  [bib]: Roland Schäfer; Felix Bildhauer
Building Large Corpora from the Web Using a New Efficient Tool Chain

L12-1498  [bib]: Stergos Afantenos; Nicholas Asher; Farah Benamara; Myriam Bras; Cecile Fabre; Mai Ho-Dac; Anne Le Draoulec; Philippe Muller; Marie-Paul Pery-Woodley; Laurent Prevot; Josette Rebeyrolles; Ludovic Tanguy; Marianne Vergez-Couret; Laure Vieu
An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus

L12-1499  [bib]: Amalia Zahra; Julie Carson-Berndsen
English to Indonesian Transliteration to Support English Pronunciation Practice

L12-1500  [bib]: Kata Gábor; Marianna Apidianaki; Benoît Sagot; Éric Villemonte de la Clergerie
Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations

L12-1501  [bib]: Khalid Choukri; Victoria Arranz; Olivier Hamon; Jungyeul Park
Using the International Standard Language Resource Number: Practical and Technical Aspects

L12-1502  [bib]: Claire Jaja; Douglas Briesch; Jamal Laoudi; Clare Voss
Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System

L12-1503  [bib]: Jens Forster; Christoph Schmidt; Thomas Hoyoux; Oscar Koller; Uwe Zelle; Justus Piater; Hermann Ney
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus

L12-1504  [bib]: Roldano Cattoni; Francesco Corcoglioniti; Christian Girardi; Bernardo Magnini; Luciano Serafini; Roberto Zanoli
The KnowledgeStore: an Entity-Based Storage System

L12-1505  [bib]: Khalid Choukri; Victoria Arranz
An Analytical Model of Language Resource Sustainability

L12-1506  [bib]: Antske Fokkens; Tania Avgustinova; Yi Zhang
CLIMB grammars: three projects using metagrammar engineering

L12-1507  [bib]: Young-Min Kim; Patrice Bellot; Elodie Faath; Marin Dacos
Annotated Bibliographical Reference Corpora in Digital Humanities

L12-1508  [bib]: Marie Tahon; Agnes Delaborde; Laurence Devillers
Corpus of Children Voices for Mid-level Markers and Affect Bursts Analysis

L12-1509  [bib]: Souhir Gahbiche-Braham; Hélène Bonneau-Maynard; Thomas Lavergne; François Yvon
Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier

L12-1510  [bib]: Hanno Biber; Evelyn Breiteneder
Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies.

L12-1511  [bib]: Natsuko Nakagawa; Yasuharu Den
Annotation of anaphoric relations and topic continuity in Japanese conversation

L12-1512  [bib]: Yoshihiko Hayashi; Chiharu Narawa
Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types

L12-1513  [bib]: Eva Szekely; Joao Paulo Cabral; Mohamed Abou-Zleikha; Peter Cahill; Julie Carson-Berndsen
Evaluating expressive speech synthesis from audiobook corpora for conversational phrases

L12-1514  [bib]: Juan María Garrido; Yesika Laplaza; Montse Marquina; Andrea Pearman; José Gregorio Escalada; Miguel Ángel Rodríguez; Ana Armenta
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech

L12-1515  [bib]: David Elson
DramaBank: Annotating Agency in Narrative Discourse

L12-1516  [bib]: Alessio Bosca; Luca Dini; Milen Kouylekov; Marco Trevisan
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.

L12-1517  [bib]: Shiva Taslimipoor; Afsaneh Fazly; Ali Hamzeh
Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions

L12-1518  [bib]: Victoria Arranz; Olivier Hamon
On the Way to a Legal Sharing of Web Applications in NLP

L12-1519  [bib]: Ralf Steinberger; Mohamed Ebrahim; Marco Turchi
JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool

L12-1520  [bib]: David Doukhan; Sophie Rosset; Albert Rilliard; Christophe d'Alessandro; Martine Adda-Decker
Designing French Tale Corpora for Entertaining Text To Speech Synthesis

L12-1521  [bib]: Helen Aristar-Dry; Sebastian Drude; Menzo Windhouwer; Jost Gippert; Irina Nevskaya
”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project

L12-1522  [bib]: Ronaldo Martins
Le Petit Prince in UNL

L12-1523  [bib]: Andrea Varga; Daniel Preotiuc-Pietro; Fabio Ciravegna
Unsupervised document zone identification using probabilistic graphical models

L12-1524  [bib]: Maria Fuentes; Horacio Rodríguez; Jordi Turmo
Summarizing a multimodal set of documents in a Smart Room

L12-1525  [bib]: Gerard de Melo; Collin F. Baker; Nancy Ide; Rebecca J. Passonneau; Christiane Fellbaum
Empirical Comparisons of MASC Word Sense Annotations

L12-1526  [bib]: Stephanie Strassel; Amanda Morris; Jonathan Fiscus; Christopher Caruso; Haejoong Lee; Paul Over; James Fiumara; Barbara Shaw; Brian Antonishek; Martial Michel
Creating HAVIC: Heterogeneous Audio Visual Internet Collection

L12-1527  [bib]: Dhouha Bouamor; Nasredine Semmar; Pierre Zweigenbaum
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation

L12-1528  [bib]: Gisela Redeker; Ildikó Berzlánovich; Nynke van der Vliet; Gosse Bouma; Markus Egg
Multi-Layer Discourse Annotation of a Dutch Text Corpus

L12-1529  [bib]: Reinhard Rapp; Serge Sharoff; Bogdan Babych
Identifying Word Translations from Comparable Documents Without a Seed Lexicon

L12-1530  [bib]: Sebastian Drude; Daan Broeder; Paul Trilsbeek; Peter Wittenburg
The Language Archive ― a new hub for language resources

L12-1531  [bib]: Mahdi Khademian; Kaveh Taghipour; Saab Mansour; Shahram Khadivi
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora

L12-1532  [bib]: Natalia Loukachevitch
Automatic Term Recognition Needs Multiple Evidence

L12-1533  [bib]: Sudheer Kolachina; Rashmi Prasad; Dipti Misra Sharma; Aravind Joshi
Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank

L12-1534  [bib]: Hannah Kermes
A methodology for the extraction of information about the usage of formulaic expressions in scientific texts

L12-1535  [bib]: Irina Temnikova; Constantin Orasan; Ruslan Mitkov
CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations

L12-1536  [bib]: Masood Ghayoomi
From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach

L12-1537  [bib]: Takafumi Suzuki; Yusuke Abe; Itsuki Toyota; Takehito Utsuro; Suguru Matsuyoshi; Masatoshi Tsuchiya
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation

L12-1538  [bib]: Mohammad Hossein Elahimanesh; Behrouz Minaei; Hossein Malekinezhad
Improving K-Nearest Neighbor Efficacy for Farsi Text Classification

L12-1539  [bib]: Dafydd Gibbon
ULex: new data models and a mobile environment for corpus enrichment.

L12-1540  [bib]: Toshinobu Ogiso; Mamoru Komachi; Yasuharu Den; Yuji Matsumoto
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese

L12-1541  [bib]: Umar Shoaib; Nadeem Ahmad; Paolo Prinetto; Gabriele Tiotto
A platform-independent user-friendly dictionary from Italian to LIS

L12-1542  [bib]: Egoitz Laparra; German Rigau; Piek Vossen
Mapping WordNet to the Kyoto ontology

L12-1543  [bib]: Bartosz Broda; Marek Maziarz; Maciej Piasecki
Tools for plWordNet Development. Presentation and Perspectives

L12-1544  [bib]: Maria Eskevich; Gareth J.F. Jones; Martha Larson; Roeland Ordelman
Creating a Data Collection for Evaluating Rich Speech Retrieval

L12-1545  [bib]: Christian Chiarcos
Ontologies of Linguistic Annotation: Survey and perspectives

L12-1546  [bib]: Christian Chiarcos; Sebastian Hellmann; Sebastian Nordhoff; Steven Moran; Richard Littauer; Judith Eckle-Kohler; Iryna Gurevych; Silvana Hartmann; Michael Matuschek; Christian M. Meyer
The Open Linguistics Working Group

L12-1547  [bib]: Mehdi Manshadi; James Allen; Mary Swift
An Annotation Scheme for Quantifier Scope Disambiguation

L12-1548  [bib]: Christian Chiarcos
A generic formalism to represent linguistic corpora in RDF and OWL/DL

L12-1549  [bib]: Eleftheria Ahtaridis; Christopher Cieri; Denise DiPersio
LDC Language Resource Database: Building a Bibliographic Database

L12-1550  [bib]: Panagiotis Giannoulis; Gerasimos Potamianos
A hierarchical approach with feature selection for emotion recognition from speech

L12-1551  [bib]: Anil Kumar Singh
A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation

L12-1552  [bib]: Ananthakrishnan Ramanathan; Karthik Visweswariah
A Study of Word-Classing for MT Reordering

L12-1553  [bib]: Gideon Kotzé; Vincent Vandeghinste; Scott Martens; Jörg Tiedemann
Large aligned treebanks for syntax-based machine translation

L12-1554  [bib]: Inguna Skadiņa; Ahmet Aker; Nikos Mastropavlos; Fangzhong Su; Dan Tufiș; Mateja Verlic; Andrejs Vasiļjevs; Bogdan Babych; Paul Clough; Robert Gaizauskas; Nikos Glaros; Monica Lestari Paramita; Mārcis Pinnis
Collecting and Using Comparable Corpora for Statistical Machine Translation

L12-1555  [bib]: Maciej Piasecki; Radoslaw Ramocki; Marek Maziarz
Recognition of Polish Derivational Relations Based on Supervised Learning Scheme

L12-1556  [bib]: Silvia Moraes; Vera Lima
Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study

L12-1557  [bib]: Hideki Shima; Teruko Mitamura
Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource

L12-1558  [bib]: Mike Kestemont; Claudia Peersman; Benny De Decker; Guy De Pauw; Kim Luyckx; Roser Morante; Frederik Vaassen; Janneke van de Loo; Walter Daelemans
The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language

L12-1559  [bib]: Iskandar Keskes; Farah Benamara; Lamia Hadrich Belguith
Clause-based Discourse Segmentation of Arabic Texts

L12-1560  [bib]: Yuichiroh Matsubayashi; Yusuke Miyao; Akiko Aizawa
Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure

L12-1561  [bib]: Mohammad Fazleh Elahi; Paola Monachesi
An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use

L12-1562  [bib]: Olga Uryupina; Massimo Poesio
Domain-specific vs. Uniform Modeling for Coreference Resolution

L12-1563  [bib]: Alexandra Balahur; Jesús M. Hermida
Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text

L12-1564  [bib]: Elsa Tolone; Stavroula Voyatzi; Claude Martineau; Matthieu Constant
Extending the adverbial coverage of a French morphological lexicon

L12-1565  [bib]: Dan Cristea; Radu Simionescu; Gabriela Haja
Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations

L12-1566  [bib]: Mārcis Pinnis
Latvian and Lithuanian Named Entity Recognition with TildeNER

L12-1567  [bib]: Masashi Inoue; Toshiki Akagi
Collecting humorous expressions from a community-based question-answering-service corpus

L12-1568  [bib]: Menzo Windhouwer
RELcat: a Relation Registry for ISOcat data categories

L12-1569  [bib]: Petya Osenova; Kiril Simov
The Political Speech Corpus of Bulgarian

L12-1570  [bib]: Eric Kow; Anja Belz
LG-Eval: A Toolkit for Creating Online Language Evaluation Experiments

L12-1571  [bib]: Silvia Pareti
A Database of Attribution Relations

L12-1572  [bib]: Rafal Rak; Andrew Rowley; Sophia Ananiadou
Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench

L12-1573  [bib]: Ying Li; Yue Yu; Pascale Fung
A Mandarin-English Code-Switching Corpus

L12-1574  [bib]: Bartosz Broda; Michał Marcińczuk; Marek Maziarz; Adam Radziszewski; Adam Wardyński
KPWr: Towards a Free Corpus of Polish

L12-1575  [bib]: Paulo Fernandes; Lucelene Lopes; Carlos A. Prolo; Afonso Sales; Renata Vieira
A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever

L12-1576  [bib]: André Santos; José João Almeida; Nuno Carvalho
Structural alignment of plain text books

L12-1577  [bib]: Seniz Demir; Ilknur Durgar El-Kahlout; Erdem Unal; Hamza Kaya
Turkish Paraphrase Corpus

L12-1578  [bib]: Keith J. Miller; Elizabeth Schroeder Richerson; Sarah McLeod; James Finley; Aaron Schein
International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned

L12-1579  [bib]: Ryan Georgi; Fei Xia; William Lewis
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms

L12-1580  [bib]: Yan Song; Fei Xia
Using a Goodness Measurement for Domain Adaptation: A Case Study on Chinese Word Segmentation

L12-1581  [bib]: Joao Paulo Cabral; Mark Kane; Zeeshan Ahmed; Mohamed Abou-Zleikha; Eva Szekely; Amalia Zahra; Kalu Ogbureke; Peter Cahill; Julie Carson-Berndsen; Stephan Schlogl
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz

L12-1582  [bib]: Peteris Paikens; Normunds Gruzitis
An implementation of a Latvian resource grammar in Grammatical Framework

L12-1583  [bib]: Pepi Stavropoulou; Dimitris Spiliotopoulos; Georgios Kouroupetroglou
Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue

L12-1584  [bib]: Silke Scheible; Richard J. Whitt; Martin Durrell; Paul Bennett
GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German

L12-1585  [bib]: João Silva; Luísa Coheur; Ângela Costa; Isabel Trancoso
Dealing with unknown words in statistical machine translation

L12-1586  [bib]: Eric Charton; Michel Gagnon
A disambiguation resource extracted from Wikipedia for semantic annotation

L12-1587  [bib]: Wilker Aziz; Sheila Castilho; Lucia Specia
PET: a Tool for Post-editing and Assessing Machine Translation

L12-1588  [bib]: Alessandro Lenci; Simonetta Montemagni; Giulia Venturi; Maria Grazia Cutrullà
Enriching the ISST-TANL Corpus with Semantic Frames

L12-1589  [bib]: Kengo Ohta; Masatoshi Tsuchiya; Seiichi Nakagawa
Developing Partially-Transcribed Speech Corpus from Edited Transcriptions

L12-1590  [bib]: Yeşim Aksan; Mustafa Aksan; Ahmet Koltuksuz; Taner Sezer; Ümit Mersinli; Umut Ufuk Demirhan; Hakan Yılmazer; Gülsüm Atasoy; Seda Öz; İpek Yıldız; Özlem Kurtoğlu
Construction of the Turkish National Corpus (TNC)

L12-1591  [bib]: Jirka Hana; Alexandr Rosen; Barbora Štindlová; Petr Jäger
Building a learner corpus

L12-1592  [bib]: Christian Federmann; Eleftherios Avramidis; Marta R. Costa-Jussà; Josef van Genabith; Maite Melero; Pavel Pecina
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation

L12-1593  [bib]: Maria Gavrilidou; Penny Labropoulou; Elina Desipri; Stelios Piperidis; Haris Papageorgiou; Monica Monachini; Francesca Frontini; Thierry Declerck; Gil Francopoulo; Victoria Arranz; Valérie Mapelli
The META-SHARE Metadata Schema for the Description of Language Resources

L12-1594  [bib]: Saeedeh Momtazi
Fine-grained German Sentiment Analysis on Social Media

L12-1595  [bib]: Maria Holmqvist; Sara Stymne; Lars Ahrenberg; Magnus Merkel
Alignment-based reordering for SMT

L12-1596  [bib]: Monica Gavrila; Walther v. Hahn; Cristina Vertan
Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation

L12-1597  [bib]: Vinodkumar Prabhakaran; Huzaifa Neralwala; Owen Rambow; Mona Diab
Annotations for Power Relations on Email Threads

L12-1598  [bib]: William J. Corvey; Sudha Verma; Sarah Vieweg; Martha Palmer; James H. Martin
Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events

L12-1599  [bib]: Isabella Poggi; Francesca D'Errico; Giovanna Leone
Pedagogical stances and their multimodal signals.

L12-1600  [bib]: Stefan Scherer; Georg Layher; John Kane; Heiko Neumann; Nick Campbell
An audiovisual political speech analysis incorporating eye-tracking and perception data

L12-1601  [bib]: Chris Irwin Davis
Tajik-Farsi Persian Transliteration Using Statistical Machine Translation

L12-1602  [bib]: Somayeh Bagherbeygi; Mehrnoush Shamsfard
Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations

L12-1603  [bib]: Anna Rumshisky; Nick Botchan; Sophie Kushkuley; James Pustejovsky
Word Sense Inventories by Non-Experts.

L12-1604  [bib]: Bernhard Brüning; Christian Schnier; Karola Pitsch; Sven Wasmuth
PAMOCAT: Automatic retrieval of specified postures

L12-1605  [bib]: Michael Tepper; Daniel Capurro; Fei Xia; Lucy Vanderwende; Meliha Yetisgen-Yildiz
Statistical Section Segmentation in Free-Text Clinical Records

L12-1606  [bib]: Rui Wang; Shuguang Li
Constructing a Question Corpus for Textual Semantic Relations

L12-1607  [bib]: Isa Maks; Piek Vossen
Building a fine-grained subjectivity lexicon from a web corpus

L12-1608  [bib]: Aude Grezka; Céline Poudat
Building a database of French frozen adverbial phrases

L12-1609  [bib]: Eneko Agirre; Ander Barrena; Oier Lopez de Lacalle; Aitor Soroa; Samuel Fernando; Mark Stevenson
Matching Cultural Heritage items to Wikipedia

L12-1610  [bib]: John Vogel; Marc Verhagen; James Pustejovsky
ATLIS: Identifying Locational Information in Text Automatically

L12-1611  [bib]: Dimitra Anastasiou
A Speech and Gesture Spatial Corpus in Assisted Living

L12-1612  [bib]: Jens Edlund; Simon Alexandersson; Jonas Beskow; Lisa Gustavsson; Mattias Heldner; Anna Hjalmarsson; Petter Kallionen; Ellen Marklund
3rd party observer gaze as a continuous measure of dialogue flow

L12-1613  [bib]: Roman Kurc; Maciej Piasecki; Bartosz Broda
Constraint Based Description of Polish Multiword Expressions

L12-1614  [bib]: Shafqat Mumtaz Virk; Elnaz Abolahrar
An Open Source Persian Computational Grammar

L12-1615  [bib]: Mariana Gomes; Ana Guilherme; Leonor Tavares; Rita Marquilhas
Project FLY: a multidisciplinary project within Linguistics

L12-1616  [bib]: José Pedro Ferreira; Maarten Janssen; Gladis Barcellos de Oliveira; Margarita Correia; Gilvan Müller de Oliveira
The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language

L12-1617  [bib]: Guido Boella; Luigi di Caro; Llio Humphreys; Livio Robaldo; Leon van der Torre
NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge

L12-1618  [bib]: Marc Tomlinson; David Bracewell; Mary Draper; Zewar Almissour; Ying Shi; Jeremy Bensley
Pursing power in Arabic on-line discussion forums

L12-1619  [bib]: Daniel Bauer; Hagen Fürstenau; Owen Rambow
The Dependency-Parsed FrameNet Corpus

L12-1620  [bib]: Michael Rosner; Albert Gatt; Andrew Attard; Jan Joachimsen
Incorporating an Error Corpus into a Spellchecker for Maltese

L12-1621  [bib]: Emília Garcia Casademont; Antonio Bonafonte; Asunción Moreno
Building Synthetic Voices in the META-NET Framework

L12-1622  [bib]: Hidetsugu Nanba; Toshiyuki Takezawa; Kiyoko Uchiyama; Akiko Aizawa
Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques

L12-1623  [bib]: Marco Dinarelli; Sophie Rosset
Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results

L12-1624  [bib]: Jan Pomikálek; Miloš Jakubíček; Pavel Rychlý
Building a 70 billion word corpus of English from ClueWeb

L12-1625  [bib]: Dong Wang; Fei Xia
Effort of Genre Variation and Prediction of System Performance

L12-1626  [bib]: Gerold Schneider; Fabio Rinaldi; Simon Clematide
Dependency parsing for interaction detection in pharmacogenomics

L12-1627  [bib]: Kyoko Ohara
Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English

L12-1628  [bib]: Octavian Popescu
Buildind a Resource of Patterns Using Semantic Types

L12-1629  [bib]: William Black; Rob Procter; Steven Gray; Sophia Ananiadou
A data and analysis resource for an experiment in text mining a collection of micro-­blogs on a political topic.

L12-1630  [bib]: Muhammad Abdul-Mageed; Mona Diab
AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis

L12-1631  [bib]: Sunao Hara; Norihide Kitaoka; Kazuya Takeda
Causal analysis of task completion errors in spoken music retrieval interactions

L12-1632  [bib]: Krešimir Šojat; Nives Mikelić Preradović; Marko Tadić
Generation of Verbal Stems in Derivationally Rich Language

L12-1633  [bib]: Fernando Castilho; Roger Granada; Breno Meneghetti; Leonardo Carvalho; Renata Vieira
Corpus+WordNet thesaurus generation for ontology enriching

L12-1634  [bib]: Paula Buttery; Andrew Caines
Reclassifying subcategorization frames for experimental analysis and stimulus generation

L12-1635  [bib]: Mateusz Kopeć; Maciej Ogrodniczuk
Creating a Coreference Resolution System for Polish

L12-1636  [bib]: Xiaoyi Ma
LDC Forced Aligner

L12-1637  [bib]: Emma Barker; Robert Gaizauskas
Assessing the Comparability of News Texts

L12-1638  [bib]: Nur-Hana Samsudin; Mark Lee
Building Text-to-Speech Systems for Resource Poor Languages

L12-1639  [bib]: Robert Speer; Catherine Havasi
Representing General Relational Knowledge in ConceptNet 5

L12-1640  [bib]: Mans Hulden; Jerid Francom
Boosting statistical tagger accuracy with simple rule-based grammars

L12-1641  [bib]: Jennifer Williams; Graham Katz
A New Twitter Verb Lexicon for Natural Language Processing

L12-1642  [bib]: Jonathan Washington; Mirlan Ipasov; Francis Tyers
A finite-state morphological transducer for Kyrgyz

L12-1643  [bib]: Marilyn Walker; Jean Fox Tree; Pranav Anand; Rob Abbott; Joseph King
A Corpus for Research on Deliberation and Debate

L12-1644  [bib]: Alexandra Roshchina; John Cardiff; Paolo Rosso
Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System

L12-1645  [bib]: Veronica Perez-Rosas; Carmen Banea; Rada Mihalcea
Learning Sentiment Lexicons in Spanish

L12-1646  [bib]: Erwin Fernandez-Ordonez; Rada Mihalcea; Samer Hassan
Unsupervised Word Sense Disambiguation with Multilingual Representations

L12-1647  [bib]: Stelios Piperidis
The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions

L12-1648  [bib]: Andrew Caines; Paula Buttery
Annotating progressive aspect constructions in the spoken section of the British National Corpus

L12-1649  [bib]: Kirk Roberts; Travis Goodwin; Sanda M. Harabagiu
Annotating Spatial Containment Relations Between Events

L12-1650  [bib]: Jacob Andreas; Sara Rosenthal; Kathleen McKeown
Annotating Agreement and Disagreement in Threaded Discussion

L12-1651  [bib]: Benoît Robichaud
Logic Based Methods for Terminological Assessment

L12-1652  [bib]: Sudheer Kolachina; Prasanth Kolachina
Parsing Any Domain English text to CoNLL dependencies

L12-1653  [bib]: Maarten Janssen
NeoTag: a POS Tagger for Grammatical Neologism Detection

L12-1654  [bib]: Harry Bunt; Michael Kipp; Volha Petukhova
Using DiAML and ANVIL for multimodal dialogue annotations

L12-1655  [bib]: Tsuyoshi Okita
Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner

L12-1656  [bib]: Tommaso Caselli; Irene Russo; Francesco Rubino
Assigning Connotation Values to Events

L12-1657  [bib]: Marilyn Walker; Grace Lin; Jennifer Sawyer
An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style

L12-1658  [bib]: Brigitte Bigi
SPPAS: a tool for the phonetic segmentation of speech

L12-1659  [bib]: Christopher Cieri; Marian Reed; Denise DiPersio; Mark Liberman
Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities

L12-1660  [bib]: Luc Boruta; Justyna Jastrzebska
A Phonemic Corpus of Polish Child-Directed Speech

L12-1661  [bib]: Sebastian Stüker; Florian Kraft; Christian Mohr; Teresa Herrmann; Eunah Cho; Alex Waibel
The KIT Lecture Corpus for Speech Translation

L12-1662  [bib]: Brigitte Bigi; Pauline Péri; Roxane Bertrand
Orthographic Transcription: which enrichment is required for phonetization?

L12-1663  [bib]: James Pustejovsky; Jessica Moszkowicz
The Role of Model Testing in Standards Development: The Case of ISO-Space

L12-1664  [bib]: Benoît Sagot; Rosa Stern
Aleda, a free large-scale entity database for French

L12-1665  [bib]: Marcello Federico; Sebastian Stüker; Luisa Bentivogli; Michael Paul; Mauro Cettolo; Teresa Herrmann; Jan Niehues; Giovanni Moretti
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation

L12-1666  [bib]: Benoît Sagot; Darja Fišer
Cleaning noisy wordnets

L12-1667  [bib]: Nicolas Hernandez
Tackling interoperability issues within UIMA work flows

L12-1668  [bib]: Djamé Seddah; Marie Candito; Benoit Crabbé; Enrique Henestroza Anguiano
Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus

L12-1669  [bib]: Valérie Hanoka; Benoît Sagot
Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources

L12-1670  [bib]: Shyam Agrawal; Shweta Sinha; Pooja Singh; Jesper Olson
Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment