:: MAIN CONFERENCE ::

Chairs
Area chairs
Invited speakers
Instructions for Authors
List of accepted papers

:: Chairs

General Conference Chair: Donia Scott (University of Brighton, UK)

Local Organization Chair: Toni Badia (Universitat Pompeu Fabra, Spain)

Program co-chairs:

Walter Daelemans
     CNTS - Linguistics Department
     University of Antwerp
     Universiteitsplein 1 (A)
     B-2610 Antwerpen
     Belgium
     +32 38202766 (office)
     +32 38202761 (fax)
     

Marilyn Walker
     Department of Computer Science
     University of Sheffield
     Regent Court, 211 Portobello Street
     Sheffield S1 4DP, U.K.
     +44 (0)114 222 1806 (office)
     +44 (0)114 222 1810 (fax)
     

:: Area Chairs

Elisabeth Andre, University of Augsburg, Germany
Jill Burstein, Educational Testing Service, USA
Claire Cardie, Cornell University, USA
Pascale Fung, University of Science and Technology, Hong Kong
Hitoshi Isahara, National Institute of Information and Communications Tecnology, NICT
Michael Johnston, AT&T, USA
Rada Mihalcea, University of North Texas, USA
Jon Oberlander, University of Edinburgh, UK
Kemal Oflazer, Sabanci University, Turkey
Kees van Deemter, University of Brighton, UK
Antal van den Bosch, University of Tilburg, The Netherlands

:: Invited speakers

The flexibility of human speech recognition and the seeds of language change

ANNE CUTLER, Max Planck Institute for Psycholinguistics

Phonetic categories of the mother tongue are learned early, and are notoriously well-anchored, in the sense that they can interfere with speech perception in second languages. However, phonetic categories of the mother tongue are not immutable. Category boundaries change over time within language communities and within individuals. A series of experiments (beginning with Norris, McQueen & Cutler, Cog. Psych., 2003) has explored the factors which determine how adult listeners retune phonetic contrasts. Dutch subjects made lexical decisions on words and nonwords, where a final phoneme in some words had been replaced by an ambiguous sound. This experience led them to shift their category boundary for a subsequent phonetic categorisation task. The boundary shift only occurred when lexical information was available to train the categorisation; exposure to the ambiguous sound in nonwords did not induce retuning. Explicit attention to the lexical information is, however, not required. Retuning can be initially speaker-specific and thus enables adjustment for individual speakers without affecting perception of other speakers. Finally, retuning generalises across words. These results suggest that feedback links in speech perception are used for learning but not for online processing, and they further argue against speech perception models with no abstract prelexical or lexical representations (such as radical episodic models).

If I Have a Hammer:
Computational Linguistics in a Reading Tutor that Listens

Jack Mostow, Research Professor
Robotics, Language Technologies, Human-Computer Interaction, Automated Learning and Discovery
Director, Project LISTEN
Carnegie Mellon University
RI-NSH 4213, 5000 Forbes Avenue, Pittsburgh, PA 15213-3890
www.cs.cmu.edu/~listen

Project LISTEN’s Reading Tutor uses speech recognition to listen to children read aloud, and helps them learn to read, as evidenced by rigorous evaluations of pre- to posttest gains compared to various controls. In the 2003-2004 school year, children ages 5-14 used the Reading Tutor daily at school on over 200 computers, logging over 50,000 sessions, 1.5 million tutorial responses, and 10 million words.
This talk uses the Reading Tutor to illustrate the diverse roles that computational linguistics can play in an intelligent tutor:

    · A domain model describes a skill to learn, such as mapping from spelling to pronunciation.
    · A production model predicts student behavior, such as likely oral reading mistakes.
    · A language model predicts likely word sequences for a given task, such as oral reading.
    · A student model estimates a student’s skills, such as mastery of grapheme-to-phoneme mappings.
    · A pedagogical model guides tutorial decisions, such as choosing words a student is ready to try.
A recurring theme is the use of “big data” to train such models automatically.

Acknowledgements

This work was supported in part by the National Science Foundation under ITR/IERI Grant No. REC-0326153. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the author and do not necessarily reflect the views of the National Science Foundation or the official policies, either expressed or implied, of the sponsors or of the United States Government.
I thank the students and educators who generated our data, and the current and past members of Project LISTEN for contributions to this work, including their publications posted at www.cs.cmu.edu/~listen.

::Instructions for Authors

For camera-ready versions of papers, use the instructions in the following page:
http://www1.cs.columbia.edu/~rambow/acl/camera-ready.html

:: List of accepted papers

A DISTRIBUTIONAL MODEL OF SEMANTIC CONTEXT EFFECTS IN LEXICAL PROCESSING
Scott Macdonald Chris Brew

A GEOMETRIC VIEW ON BILINGUAL LEXICON EXTRACTION FROM COMPARABLE CORPORA
E. Gaussier, J.-M. Renders, I. Matveeva, C. Goutte, H. Dejean

A JOINT SOURCE-CHANNEL MODEL FOR MACHINE TRANSLITERATION
Haizhou LI, Min ZHANG, Jian SU

A KERNEL PCA METHOD FOR SUPERIOR WORD SENSE DISAMBIGUATION
Dekai WU, Weifeng SU, Marine CARPUAT

A MENTION-SYNCHRONOUS COREFERENCE RESOLUTION ALGORITHM BASED ON THE BELL TREE
Xiaoqiang Luo, Abe Ittycheriah, Hongyan Jing, Nanda Kambhatla, Salim Roukos

A SENTIMENTAL EDUCATION: SENTIMENT ANALYSIS USING SUBJECTIVITY SUMMARIZATION
Bo Pang, Lillian Lee

A STUDY ON CONVOLUTION KERNELS FOR SHALLOW SEMANTIC PARSING
Alessandro Moschitti

A TAG-BASED NOISY CHANNELMODEL OF SPEECH REPAIRS
Mark Johnson Eugene Charniak

A UNIFIED FRAMEWORK FOR AUTOMATIC EVALUATION USING N-GRAM CO-OCCURRENCE STATISTICS
Radu Soricut, Eric Brill

ACQUIRING THE MEANING OF DISCOURSE MARKERS
Ben Hutchinson

ADAPTIVE CHINESE WORD SEGMENTATION
Jianfeng Gao, Andi Wu, Mu Li, Chang-Ning Huang, Hongqiao Li, Xinsong Xia, Haowei Qin

ALIGNING WORDS USING MATRIX FACTORISATION
Cyril Goutte, Kenji Yamada, Eric Gaussier

ALTERNATIVE APPROACHES FOR GENERATING BODIES OF GRAMMAR RULES
Gabriel Infante-Lopez Maarten de Rijke

AN ALTERNATIVE METHOD OF TRAINING PROBABILISTIC LR PARSERS
Mark-Jan Nederhof, Giorgio Satta

AN EMPIRICAL STUDY OF INFORMATION SYNTHESIS TASKS
Enrique Amigo, Julio Gonzalo, Victor Peinado, Anselmo Peñas, Felisa Verdejo

ANALYSIS OF MIXED NATURAL AND SYMBOLIC LANGUAGE INPUT IN MATHEMATICAL DIALOGS
Magdalena Wolska, Ivana Kruijff-Korbayova

ANNEALING TECHNIQUES FOR UNSUPERVISED STATISTICAL LANGUAGE LEARNING
Noah A. Smith, Jason Eisner

APPLYING MACHINE LEARNING TO CHINESE TEMPORAL RELATION RESOLUTION
Wenjie Li, Guihong Cao, Kam-Fai Wong, Chunfa Yuan

ATTENTION SHIFTING FOR PARSING SPEECH
Keith Hall and Mark Johnson

AUTOMATIC EVALUATION OF MACHINE TRANSLATION QUALITY USING LONGEST COMMON SUBSEQUENCE AND SKIP-BIGRAM STATISTICS
Chin-Yew Lin and Franz Josef Och

BALANCING CLARITY AND EFFICIENCY IN TYPED FEATURE LOGIC THROUGH DELAYING
Gerald Penn

BUILDING VERB PREDICATES: A COMPUTATIONAL VIEW
Fernando Gomez

CHINESE VERB SENSE DISCRIMINATION USING AN EM CLUSTERING MODEL WITH RICH LINGUISTIC FEATURES
Jinying Chen, Martha Palmer

CLASSIFYING SEMANTIC RELATIONS IN BIOSCIENCE TEXTS
Barbara Rosario, Marti A. Hearst

COLLECTIVE INFORMATION EXTRACTION WITH RELATIONAL MARKOV NETWORKS
Razvan Bunescu and Raymond J. Mooney

COLLOCATION TRANSLATION ACQUISITION USING MONOLINGUAL CORPORA
Yajuan Lv, Ming Zhou

COMBINING ACOUSTIC AND PRAGMATIC FEATURES TO PREDICT RECOGNITION PERFORMANCE IN SPOKEN DIALOGUE SYSTEMS
Malte Gabsdil, Oliver Lemon

COMPUTING LOCALLY COHERENT DISCOURSES
Ernst Althaus, Nikiforos Karamanis, Alexander Koller

CONSTRUCTIVIST DEVELOPMENT OF GROUNDED CONSTRUCTION GRAMMAR
Luc Steels

CONVOLUTION KERNELS WITH FEATURE SELECTION FOR NATURAL LANGUAGE PROCESSING TASKS
Jun Suzuki, Hideki Isozaki

CORPUS-BASED INDUCTION OF SYNTACTIC STRUCTURE: MODELS OF DEPENDENCY AND CONSTITUENCY
Dan Klein and Christopher D. Manning

CREATING MULTILINGUAL TRANSLATION LEXICONS WITH REGIONAL VARIATIONS USING WEB CORPORA
Pu-Jen Cheng, Wen-Hsiang Lu, Jei-Wen Teng, Lee-Feng Chien

DATA-DRIVEN STRATEGIES FOR AN AUTOMATED DIALOGUE SYSTEM
Hilda Hardy, Alan Biermann, R. Bryce Inouye, Ashley Mckenzie, Tomek Strzalkowski, Cristian Ursu, Nick Webb, Min Wu

DEPENDENCY TREE KERNELS FOR RELATION EXTRACTION
Aron Culotta Jeffrey Sorensen

DEVELOPING A FLEXIBLE SPOKEN DIALOG SYSTEM USING SIMULATION
Grace Chung

DIMENSIONS OF PARSING
I. Dan Melamed

DISCOVERING RELATIONS AMONG NAMED ENTITIES FROM LARGE CORPORA
Takaaki Hasegawa, Satoshi Sekine and Ralph Grishman

DISCRIMINATIVE LANGUAGE MODELING WITH CONDITIONAL RANDOM FIELDS AND THE PERCEPTRON ALGORITHM
Brian Roark, Murat Saraclar, Michael Collins, Mark Johnson

DISCRIMINATIVE TRAINING OF A NEURAL NETWORK STATISTICAL PARSER
James Henderson

ENRICHING THE OUTPUT OF A PARSER USING MEMORY-BASED LEARNING
Valentin Jijkoun and Maarten de Rijke

ERROR MINING FOR WIDE-COVERAGE GRAMMAR ENGINEERING
Gertjan van Noord

EVALUATING CENTERING-BASED METRICS OF COHERENCE FOR TEXT STRUCTURING USING A RELIABLY ANNOTATED CORPUS
Nikiforos Karamanis, Massimo Poesio, Chris Mellish, Jon Oberlander

EXPERIMENTS IN PARALLEL-TEXT BASED GRAMMAR INDUCTION
Jonas Kuhn

EXTENDING BLEU MT EVALUATION METHOD WITH FREQUENCY WEIGHING
Bogdan Babych, Anthony Hartley

EXTRACTING REGULATORY GENE EXPRESSION NETWORKS FROM PUBMED
Jasmin Saric, Lars J. Jensen, Peer Bork, Rossitza Ouzounova, Isabel Rojas

FINDING IDEOGRAPHIC REPRESENTATIONS OF JAPANESE NAMES WRITTEN IN LATIN SCRIPT VIA LANGUAGE IDENTIFICATION AND CORPUS VALIDATION
Yan Qu, Gregory Grefenstette

FINDING PREDOMINANT WORD SENSES IN UNTAGGED TEXT
Diana McCarthy, Rob Koeling, Julie Weeds, John Carroll

FLSA: EXTENDING LATENT SEMANTIC ANALYSIS WITH FEATURES\\ FOR DIALOGUE ACT CLASSIFICATION
Riccardo Serafin, Barbara Di Eugenio

FSA: AN EFFICIENT AND FLEXIBLE C++ TOOLKIT FOR FINITE STATE AUTOMATA USING ON-DEMAND COMPUTATION
Stephan Kanthak, Hermann Ney

GENERALIZED MULTITEXT GRAMMARS
I. Dan Melamed, Giorgio Satta, Benjamin Wellington

GENERATING REFERRING EXPRESSIONS IN OPEN DOMAINS
Advaith Siddharthan Ann Copestake

HEAD-DRIVING PARSING FOR WORD LATTICES
Christopher Collins Gerald Penn Bob Carpenter

IDENTIFYING AGREEMENT AND DISAGREEMENT IN CONVERSATIONAL SPEECH: USE OF BAYESIAN NETWORKS TO MODEL PRAGMATIC DEPENDENCIES
Michel Galley, Kathleen McKeown, Julia Hirschberg, Elizabeth Shriberg

IMPROVING IBM WORD-ALIGNMENT MODEL 1
Robert C. Moore

IMPROVING PRONOUN RESOLUTION BY INCORPORATING COREFERENTIAL INFORMATION OF CANDIDATES
Xiaofeng Yang, Jian Su, Guodong Zhou, and Chew Lim Tan

INCREMENTAL PARSING WITH THE PERCEPTRON ALGORITHM
Michael Collins and Brian Roark

INDUCING FRAME SEMANTIC VERB CLASSES FROM WORDNET AND LDOCE
Rebecca Green Bonnie J. Dorr Philip Resnik

LARGE-SCALE INDUCTION AND EVALUATION OF LEXICAL RESOURCES FROM THE PENN-II TREEBANK
Michael Burke, Aoife Cahill, Ruth O'Donovan, Josef van Genabith and Andy Way

LEARNING NOUN PHRASE ANAPHORICITY TO IMPROVE COREFERENCE RESOLUTION: ISSUES IN REPRESENTATION AND OPTIMIZATION
Vincent Ng and Claire Cardie

LEARNING TO RESOLVE BRIDGING REFERENCES
Massimo Poesio, Rahul Mehta, Axel Maroudas, Janet Hitzeman

LEARNING WITH UNLABELED DATA FOR TEXT CATEGORIZATION USING A BOOTSTRAPPING AND A FEATURE PROJECTION TECHNIQUE
Youngjoong Ko and Jungyun Seo

LEARNING WORD SENSE WITH FEATURE SELECTION AND ORDER IDENTIFICATION CAPABILITIES
Zhengyu Niu, Donghong Ji, Kim-Teng Lua

LINGUISTIC PROFILING FOR AUTHORSHIP RENOGNITION AND VERIFICATION
Hans van Halteren

LONG-DISTANCE DEPENDENCY RESOLUTION IN AUTOMATICALLY ACQUIRED WIDE-COVERAGE PCFG-BASED LFG APPROXIMATIONS
Aoife Cahill, Michael Burke, Ruth O'Donovan, Josef van Genabith, Andy Way

MINIMAL RECURSION SEMANTICS AS DOMINANCE CONSTRAINTS: TRANSLATION, EVALUATION, AND ANALYSIS
Ruth Fuchss, Alexander Koller, Joachim Niehren, Stefan Thater

MINING METALINGUISTIC ACTIVITY IN CORPORA TO CREATE LEXICAL RESOURCES USING INFORMATION EXTRACTION TECHNIQUES: THE MOP SYSTEM
Carlos Rodriguez Penagos

MULTI-CRITERIA-BASED ACTIVE LEARNING FOR NAMED ENTITY RECOGNITION
Dan SHEN, Jie ZHANG, Jian SU, Guodong ZHOU and Chew-Lim TAN

MULTI-ENGINE MACHINE TRANSLATION WITH VOTED LANGUAGE MODEL
Tadashi Nomoto

OPTIMIZATION IN MULTIMODAL INTERPRETATION
Joyce Y. Chai, Pengyu Hong

OPTIMIZING TYPED FEATURE STRUCTURE GRAMMAR PARSING THROUGH NON-STATISTICAL INDEXING
Cosmin MUNTEANU, Gerald PENN

PARAGRAPH-, WORD-, AND COHERENCE-BASED APPROACHES TO SENTENCE RANKING: A COMPARISON OF ALGORITHM AND HUMAN PERFORMANCE
Florian Wolf, Edward Gibson

PARSING THE WSJ USING CCG AND LOG-LINEAR MODELS
Stephen Clark and James R. Curran

PREDICATE-ARGUMENT STRUCTURE FROM BROAD-COVERAGE PARSE TREES: IMPROVING ON THE CONTEXT-FREE APPROXIMATION
Roger Levy and Christopher D. Manning

PREDICTING STUDENT EMOTIONS IN COMPUTER-HUMAN TUTORING DIALOGUES
Diane J. Litman and Katherine Forbes-Riley

PROBABILISTIC PARSING STRATEGIES
Mark-Jan Nederhof, Giorgio Satta

QUESTION ANSWERING USING CONSTRAINT SATISFACTION: QA-BY-DOSSIER-WITH-CONSTRAINTS
John Prager, Jennifer Chu-Carroll, Krzysztof Czuba

RELIEVING THE DATA ACQUISITION BOTTLENECK IN WORD SENSE DISAMBIGUATION
Mona Diab

SPLITTING COMPLEX TEMPORAL QUESTIONS FOR QUESTION ANSWERING SYSTEMS
E. Saquete, P. Martínez-Barco, R. Muñoz and J.L. Vicedo

STATISTICAL MACHINE TRANSLATION WITH WORD- AND SENTENCE-ALIGNED PARALLEL CORPORA
Chris Callison-Burch, David Talbot, and Miles Osborne

STATISTICAL MODELING FOR UNIT SELECTION IN SPEECH SYNTHESIS
Cyril Allauzen, Mehryar Mohri and Michael Riley

THE SENTIMENTAL FACTOR: IMPROVING REVIEW CLASSIFICATION VIA HUMAN-PROVIDED INFORMATION
Philip Beineke, Trevor Hastie, Shivakumar Vaithyanathan

TRAINABLE SENTENCE PLANNING FOR COMPLEX INFORMATION PRESENTATION IN SPOKEN DIALOG SYSTEMS
Amanda Stent, Rashmi Prasad, and Marilyn Walker

UNSUPERVISED LEARNING OF CHINESE VERB SENSES BY USING AN EM CLUSTERING MODEL WITH RICH LINGUISTIC FEATURES
Jinying Chen, Martha Palmer

UNSUPERVISED SENSE DISAMBIGUATION USING BILINGUAL PROBABILISTIC MODELS
Indrajit Bhattacharya Lise Getoor Yoshua Bengio

USER EXPERTISE MODELLING AND ADAPTIVITY IN A SPEECH-BASED E-MAIL SYSTEM
Kristiina Jokinen and Kari Kanto

USING CONDITIONAL RANDOM FIELDS TO PREDICT PITCH ACCENTS IN CONVERSATIONAL SPEECH
Michelle Gregory and Yasemin Altun

USING LINGUISTIC PRINCIPLES TO RECOVER EMPTY CATEGORIES
Richard Campbell

WEAKLY SUPERVISED LEARNING FOR CROSS-DOCUMENT PERSON NAME DISAMBIGUATION SUPPORTED BY INFORMATION EXTRACTION
Cheng Niu, Wei Li and Rohini K. Srihari

WRAPPING OF TREES
James Rogers

top