Sreelekha S


2018

pdf bib
Morphology Injection for English-Malayalam Statistical Machine Translation
Sreelekha S | Pushpak Bhattacharyya
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2016

pdf bib
Lexical Resources to Enrich English Malayalam Machine Translation
Sreelekha S | Pushpak Bhattacharyya
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

In this paper we present our work on the usage of lexical resources for the Machine Translation English and Malayalam. We describe a comparative performance between different Statistical Machine Translation (SMT) systems on top of phrase based SMT system as baseline. We explore different ways of utilizing lexical resources to improve the quality of English Malayalam statistical machine translation. In order to enrich the training corpus we have augmented the lexical resources in two ways (a) additional vocabulary and (b) inflected verbal forms. Lexical resources include IndoWordnet semantic relation set, lexical words and verb phrases etc. We have described case studies, evaluations and have given detailed error analysis for both Malayalam to English and English to Malayalam machine translation systems. We observed significant improvement in evaluations of translation quality. Lexical resources do help uplift performance when parallel corpora are scanty.

2015

pdf bib
Solving Data Sparsity by Morphology Injection in Factored SMT
Sreelekha S | Piyush Dungarwal | Pushpak Bhattacharyya | Malathi D
Proceedings of the 12th International Conference on Natural Language Processing