Vicent Alabau

Also published as: Vicente Alabau


2014

pdf bib
CASMACAT: A Computer-assisted Translation Workbench
Vicent Alabau | Christian Buck | Michael Carl | Francisco Casacuberta | Mercedes García-Martínez | Ulrich Germann | Jesús González-Rubio | Robin Hill | Philipp Koehn | Luis Leiva | Bartolomé Mesa-Lao | Daniel Ortiz-Martínez | Herve Saint-Amand | Germán Sanchis Trilles | Chara Tsoukala
Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics

pdf bib
Integrating online and active learning in a computer-assisted translation workbench
Vicent Alabau | Jesús González-Rubio | Daniel Ortiz-Martínez | Germán Sanchis-Trilles | Francisco Casacuberta | Mercedes García-Martínez | Bartolomé Mesa-Lao | Dan Cheung Petersen | Barbara Dragsted | Michael Carl
Workshop on interactive and adaptive machine translation

This paper describes a pilot study with a computed-assisted translation workbench aiming at testing the integration of online and active learning features. We investigate the effect of these features on translation productivity, using interactive translation prediction (ITP) as a baseline. User activity data were collected from five beta testers using key-logging and eye-tracking. User feedback was also collected at the end of the experiments in the form of retrospective think-aloud protocols. We found that OL performs better than ITP, especially in terms of translation speed. In addition, AL provides better translation quality than ITP for the same levels of user effort. We plan to incorporate these features in the final version of the workbench.

pdf bib
Collaborative web UI localization, or how to build feature-rich multilingual datasets
Vicent Alabau | Luis A. Leiva
Proceedings of the 17th Annual Conference of the European Association for Machine Translation

pdf bib
Evaluating the effects of interactivity in a post-editing workbench
Nancy Underwood | Bartolomé Mesa-Lao | Mercedes García Martínez | Michael Carl | Vicent Alabau | Jesús González-Rubio | Luis A. Leiva | Germán Sanchis-Trilles | Daniel Ortíz-Martínez | Francisco Casacuberta
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

This paper describes the field trial and subsequent evaluation of a post-editing workbench which is currently under development in the EU-funded CasMaCat project. Based on user evaluations of the initial prototype of the workbench, this second prototype of the workbench includes a number of interactive features designed to improve productivity and user satisfaction. Using CasMaCat’s own facilities for logging keystrokes and eye tracking, data were collected from nine post-editors in a professional setting. These data were then used to investigate the effects of the interactive features on productivity, quality, user satisfaction and cognitive load as reflected in the post-editors’ gaze activity. These quantitative results are combined with the qualitative results derived from user questionnaires and interviews conducted with all the participants.

pdf bib
Proofreading Human Translations with an E-pen
Vicent Alabau | Luis A. Leiva
Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation

2013

pdf bib
User Evaluation of Advanced Interaction Features for a Computer-Assisted Translation Workbench
Vicente Alabau | Jesus Gonzalez-Rubio | Luis A. Leiva | Daniel Ortiz-Martínez | German Sanchis-Trilles | Francisco Casacuberta | Bartolomé Mesa-Lao | Ragnar Bonk | Michael Carl | Mercedes Garcia-Martinez
Proceedings of Machine Translation Summit XIV: User track

pdf bib
Advanced computer aided translation with a web-based workbench
Vicent Alabau | Ragnar Bonk | Christian Buck | Michael Carl | Francisco Casacuberta | Mercedes García-Martínez | Jesús González | Philipp Koehn | Luis Leiva | Bartolomé Mesa-Lao | Daniel Oriz | Hervé Saint-Amand | Germán Sanchis | Chara Tsiukala
Proceedings of the 2nd Workshop on Post-editing Technology and Practice

2011

pdf bib
Improving On-line Handwritten Recognition using Translation Models in Multimodal Interactive Machine Translation
Vicent Alabau | Alberto Sanchis | Francisco Casacuberta
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

pdf bib
An Interactive Machine Translation System with Online Learning
Daniel Ortiz-Martínez | Luis A. Leiva | Vicent Alabau | Ismael García-Varea | Francisco Casacuberta
Proceedings of the ACL-HLT 2011 System Demonstrations

2010

pdf bib
ITI-UPV machine translation system for IWSLT 2010
Guillem Gascó | Vicent Alabau | Jesús-Andrés Ferrer | Jesús González-Rubio | Martha-Alicia Rocha | Germán Sanchis-Trilles | Francisco Casacuberta | Jorge González | Joan-Andreu Sánchez
Proceedings of the 7th International Workshop on Spoken Language Translation: Evaluation Campaign

This paper presents the submissions of the PRHLT group for the evaluation campaign of the International Workshop on Spoken Language Translation. We focus on the development of reliable translation systems between syntactically different languages (DIALOG task) and on the efficient training of SMT models in resource-rich scenarios (TALK task).

2009

pdf bib
Statistical Post-Editing of a Rule-Based Machine Translation System
Antonio-L. Lagarda | Vicent Alabau | Francisco Casacuberta | Roberto Silva | Enrique Díaz-de-Liaño
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers

2008

pdf bib
Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation
Míriam Luján | Carlos D. Martínez | Vicent Alabau
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Multilingual Automatic Speech Recognition (ASR) systems are of great interest in multilingual environments. We studied the case of the Comunitat Valenciana where the two official languages are Spanish and Valencian. These two languages share most of their phonemes, and their syntax and vocabulary are also quite similar since they have influenced each other for many years. We constructed a system, and trained its acoustic models with a small corpus of Spanish and Valencian, which has produced poor results due to the lack of data. Adaptation techniques can be used to adapt acoustic models that are trained with a large corpus of a language inr order to obtain acoustic models for a phonetically similar language. This process is known as language adaptation. The Maximum Likelihood Linear Regression (MLLR) technique has commonly been used in speaker adaptation; however we have used MLLR in language adaptation. We compared several MLLR variants (mean square, diagonal matrix and full matrix) for language adaptation in order to choose the best alternative for our system.

2007

pdf bib
Improving speech-to-speech translation using word posterior probabilities
Vicente Alabau | Alberto Sanchis | Francisco Casacuberta
Proceedings of Machine Translation Summit XI: Papers

pdf bib
Using word posterior probabilities in lattice translation
Vicente Alabau | Alberto Sanchis | Francisco Casacuberta
Proceedings of the Fourth International Workshop on Spoken Language Translation

In this paper we describe the statistical machine translation system developed at ITI/UPV, which aims especially at speech recognition and statistical machine translation integration, for the evaluation campaign of the International Workshop on Spoken Language Translation (2007). The system we have developed takes advantage of an improved word lattice representation that uses word posterior probabilities. These word posterior probabilities are then added as a feature to a log-linear model. This model includes a stochastic finite-state transducer which allows an easy lattice integration. Furthermore, it provides a statistical phrase-based reordering model that is able to perform local reorderings of the output. We have tested this model on the Italian-English corpus, for clean text, 1-best ASR and lattice ASR inputs. The results and conclusions of such experiments are reported at the end of this paper.

2006

pdf bib
Bilingual speech corpus in two phonetically similar languages
Vicente Alabau | Carlos D. Martínez
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

As Speech Recognition Systems improve, they become suitable for facingnew problems. Multilingual speech recognition is one such problems. In the present work, the case of the Comunitat Valenciana multilingual environment is studied. The official languages in the Comunitat Valenciana (Spanish and Valencian) share most of their acoustic units, and their vocabularies and syntax are quite similar. They have influenced each other for many years.A small corpus on an Information System task was developed for experimentationpurposes.This choice will make it possible to develop a working prototype in the future,and it is simple enough to build semi-automatic language models. The design of the acoustic corpus is discussed, showing that all combinations of accents have been studied (native, non-native speakers, male, female, etc.).