ACL Logo ACL Anthology
A Digital Archive of Research Papers in Computational Linguistics

Google search the Anthology
» Toggle Table of Contents
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-II
Proceedings of the IJCNLP-08 Workshop on NLP for Less Privileged Languages
Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing
Proceedings of the IJCNLP-08 Workshop on Named Entity Recognition for South and South East Asian Languages
Proceedings of the 2nd workshop on Cross Lingual Information Access (CLIA) Addressing the Information Need of Multilingual Societies
Proceedings of the 6th Workshop on Asian Language Resources
Proceedings of the Workshop on Technologies and Corpora for Asia-Pacific Speech Translation (TCAST)

Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I

I08-1: Entire volume

I08-1000: Front Matter

I08-1001: Badam-Osor Khaltar; Atsushi Fujii
A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval

I08-1002: Hai Zhao; Chunyu Kit
An Empirical Comparison of Goodness Measures for Unsupervised Chinese Word Segmentation with a Unified Framework

I08-1003: Michael Tepper; Fei Xia
A Hybrid Approach to the Induction of Underlying Morphology

I08-1004: GuoDong Zhou; Fang Kong; QiaoMing Zhu
Context-Sensitive Convolution Tree Kernel for Pronoun Resolution

I08-1005: GuoDong Zhou; JunHui Li; LongHua Qian; QiaoMing Zhu
Semi-Supervised Learning for Relation Extraction

I08-1006: Xiaoyan Zhang; Ting Wang; Huowang Chen
Story Link Detection based on Dynamic Information Extending

I08-1007: Eiji Aramaki; Takeshi Imai; Kengo Miyo; Kazuhiko Ohe
Orthographic Disambiguation Incorporating Transliterated Probability

I08-1008: Min Zhang; Chengjie Sun; Haizhou Li; AiTi Aw; Chew Lim Tan; Xiaolong Wang
Name Origin Recognition Using Maximum Entropy Model and Diverse Features

I08-1009: Harshit Surana; Anil Kumar Singh
A More Discerning and Adaptable Multilingual Transliteration Mechanism for Indian Languages

I08-1010: G. Bharadwaja Kumar; Kavi Narayana Murthy
UCSG: A Wide Coverage Shallow Parsing System

I08-1011: Prachya Boonkwan; Thepchai Supnithi
Memory-Inductive Categorial Grammar: An Approach to Gap Resolution in Analytic-Language Translation

I08-1012: Wenliang Chen; Daisuke Kawahara; Kiyotaka Uchimoto; Yujie Zhang; Hitoshi Isahara
Dependency Parsing with Short Dependency Relations in Unlabeled Data

I08-1013: Béatrice Daille; Emmanuel Morin
An Effective Compositional Model for Lexical Alignment

I08-1014: Wilson Wong; Wei Liu; Mohammed Bennamoun
Determining the Unithood of Word Sequences Using a Probabilistic Approach

I08-1015: Dinakar Jayarajan; Dipti Deodhare; Balaraman Ravindran
Lexical Chains as Document Features

I08-1016: Ani Nenkova
Entity-driven Rewrite for Multi-document Summarization

I08-1017: Xiaofeng Wu; Chengqing Zong
A New Approach to Automatic Document Summarization

I08-1018: Harendra Bhandari; Masashi Shimbo; Takahiko Ito; Yuji Matsumoto
Generic Text Summarization Using Probabilistic Latent Semantic Indexing

I08-1019: Yasunari Miyabe; Hiroya Takamura; Manabu Okumura
Identifying Cross-Document Relations between Sentences

I08-1020: Horacio Saggion
Experiments on Semantic-based Clustering for Cross-document Coreference

I08-1021: Long Qiu; Min-Yen Kan; Tat-Seng Chua
Modeling Context in Scenario Template Creation

I08-1022: Ke Wu; Xiaolin Wang; Bao-Liang Lu
Cross Language Text Categorization Using a Bilingual Lexicon

I08-1023: Yong Wang; Yiqun Liu; Min Zhang; Shaoping Ma; Liyun Ru
Identify Temporal Websites Based on User Behavior Analysis

I08-1024: Youssef Kadri; Jian-Yun Nie
A Comparative Study for Query Translation using Linear Combination and Confidence Measure

I08-1025: Keiji Shinzato; Tomohide Shibata; Daisuke Kawahara; Chikara Hashimoto; Sadao Kurohashi
TSUBAKI: An Open Search Engine Infrastructure for Developing New Information Access Methodology

I08-1026: Fan Ding; Bin Wang
A Study on Effectiveness of Syntactic Relationship in Dependence Retrieval Model

I08-1027: Takashi Shichiri; Hiroaki Nanjo; Takehiko Yoshimi
Automatic Estimation of Word Significance oriented for Speech-based Information Retrieval

I08-1028: Yuichiro Fukubayashi; Kazunori Komatani; Mikio Nakano; Kotaro Funakoshi; Hiroshi Tsujino; Tetsuya Ogata; Hiroshi G. Okuno
Rapid Prototyping of Robust Language Understanding Modules for Spoken Dialogue Systems

I08-1029: Gina-Anne Levow
Automatic Prosodic Labeling with Conditional Random Fields and Rich Acoustic Features

I08-1030: Ruiqiang Zhang; Eiichiro Sumita
Chinese Unknown Word Translation by Subword Re-segmentation

I08-1031: Jong-Hoon Oh; Hitoshi Isahara
Hypothesis Selection in Machine Transliteration: A Web Mining Approach

I08-1032: Takeshi Abekawa; Kyo Kageura
What Prompts Translators to Modify Draft Translations? An Analysis of Basic Modification Patterns for Use in the Automatic Notification of Awkwardly Translated Text

I08-1033: Ming-Hong Bai; Keh-Jiann Chen; Jason S. Chang
Improving Word Alignment by Adjusting Chinese Word Segmentation

I08-1034: Marina Sokolova; Vivi Nastase; Stan Szpakowicz
The Telling Tail: Signals of Success in Electronic Negotiation Texts

I08-1035: Dipanjan Das; Mohit Kumar; Alexander I. Rudnicky
Automatic Extraction of Briefing Templates

I08-1036: Oksana Yakhnenko; Barbara Rosario
Mining the Web for Relations between Digital Devices using a Probabilistic Maximum Margin Model

I08-1037: Yu-Chun Wang; Richard Tzong-Han Tsai; Wen-Lian Hsu
Learning Patterns from the Web to Translate Named Entities for Cross Language Information Retrieval

I08-1038: Bo Wang; Houfeng Wang
Bootstrapping Both Product Features and Opinion Words from Chinese Customer Reviews with Cross-Inducing

I08-1039: Daisuke Ikeda; Hiroya Takamura; Lev-Arie Ratinov; Manabu Okumura
Learning to Shift the Polarity of Words for Sentiment Classification

I08-1040: Taras Zagibalov; John Carroll
Unsupervised Classification of Sentiment and Objectivity in Chinese Text

I08-1041: Saima Aman; Stan Szpakowicz
Using Roget's Thesaurus for Fine-grained Emotion Recognition

I08-1042: Jesús Giménez; Lluís Márquez
Heterogeneous Automatic MT Evaluation Through Non-Parametric Metric Combinations

I08-1043: Young-Sook Hwang; Young-Kil Kim; Sangkyu Park
Paraphrasing Depending on Bilingual Context Toward Generalization of Translation Knowledge

I08-1044: Xiaofeng Yu; Wai Lam; Shing-Kit Chan
A Framework Based on Graphical Models with Logic for Chinese Named Entity Recognition

I08-1045: Sujan Kumar Saha; Sudeshna Sarkar; Pabitra Mitra
A Hybrid Feature Set based Maximum Entropy Hindi Named Entity Recognition

I08-1046: Stanley; Wai Keong Yong; Jian Su
An Eeffective Methods of Using Web Based Information for Relation Extraction

I08-1047: Mamoru Komachi; Hisami Suzuki
Minimally Supervised Learning of Semantic Knowledge from Query Logs

I08-1048: Jingbo Zhu; Huizhen Wang; Eduard Hovy
Learning a Stopping Criterion for Active Learning for Word Sense Disambiguation and Text Classification

I08-1049: Jin-Shea Kuo; Haizhou Li
Multi-View Co-Training of Transliteration Model

I08-1050: Kenji Hirohata; Naoaki Okazaki; Sophia Ananiadou; Mitsuru Ishizuka
Identifying Sections in Scientific Abstracts using Conditional Random Fields

I08-1051: Aljoscha Burchardt; Sebastian Padó; Dennis Spohr; Anette Frank; Ulrich Heid
Formalising Multi-layer Corpora in OWL DL - Lexicon Modelling, Querying and Consistency Control

I08-1052: Kiyoaki Shirai; Takenobu Tokunaga; Chu-Ren Huang; Shu-Kai Hsieh; Tzu-Yi Kuo; Virach Sornlertlamvanich; Thatsanee Charoenporn
Constructing Taxonomy of Numerative Classifiers for Asian Languages

I08-1053: Nikesh Garera; David Yarowsky
Translating Compounds by Learning Component Gloss Translation Models via Multiple Languages

I08-1054: Marius Pasca
Answering Definition Questions via Temporally-Anchored Text Snippets

I08-1055: Ryuichiro Higashinaka; Hideki Isozaki
Corpus-based Question Answering for why-Questions

I08-1056: Lucian Vlad Lita; Jaime Carbonell
Cluster-Based Query Expansion for Statistical Question Answering

I08-1057: Chen-Ming Hung
A Semantic Feature for Relation Recognition Using a Web-based Corpus

I08-1058: Yo Ehara; Kumiko Tanaka-Ishii
Multilingual Text Entry using Automatic Language Detection

I08-1059: Michael Gamon; Jianfeng Gao; Chris Brockett; Alexandre Klementiev; William B. Dolan; Dmitriy Belenko; Lucy Vanderwende
Using Contextual Speller Techniques and Language Modeling for ESL Error Correction

I08-1060: Takashi Tsunakawa; Jun'ichi Tsujii
Bilingual Synonym Identification with Spelling Variations

I08-1061: Nikesh Garera; David Yarowsky
Minimally Supervised Multilingual Taxonomy and Translation Lexicon Induction

I08-1062: Jessica Ramírez; Masayuki Asahara; Yuji Matsumoto
Japanese-Spanish Thesaurus Construction Using English as a Pivot

I08-1063: M. Saravanan; B. Ravindran; S. Raman
Automatic Identification of Rhetorical Roles using Conditional Random Fields for Legal Document Summarization

I08-1064: Kathrin Spreyer; Anette Frank
Projection-based Acquisition of a Temporal Labeller

I08-1065: Shuya Abe; Kentaro Inui; Yuji Matsumoto
Acquiring Event Relation Knowledge by Learning Cooccurrence Patterns and Fertilizing Cooccurrence Samples with Verbal Nouns

I08-1066: Deyi Xiong; Min Zhang; AiTi Aw; Haitao Mi; Qun Liu; Shouxun Lin
Refinements in BTG-based Statistical Machine Translation

I08-1067: Ananthakrishnan Ramanathan; Jayprasad Hegde; Ritesh M. Shah; Pushpak Bhattacharyya; Sasikumar M
Simple Syntactic and Morphological Processing Can Help English-Hindi Statistical Machine Translation

I08-1068: Rohini U; Vamshi Ambati; Vasudeva Varma
Statistical Machine Translation Models for Personalized Search

I08-1069: Fei Xia; William D. Lewis
Repurposing Theoretical Linguistic Data for Tool Development and Search

I08-1070: Atsushi Fujita; Satoshi Sato
Computing Paraphrasability of Syntactic Variants Using Web Snippets

I08-1071: Wisam Dakka; Silviu Cucerzan
Augmenting Wikipedia with Named Entity Tags

I08-1072: Masato Hagiwara; Yasuhiro Ogawa; Katsuhiko Toyama
Context Feature Selection for Distributional Similarity

I08-1073: Ryu Iida; Diana McCarthy; Rob Koeling
Gloss-Based Semantic Similarity Metrics for Predominant Sense Acquisition

I08-1074: Su Nam Kim; Timothy Baldwin
Benchmarking Noun Compound Interpretation

Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-II

I08-2: Entire volume

I08-2075: Aasish Pappu; Ratna Sanyal
Vaakkriti: Sanskrit Tokenizer

I08-2076: Amir Azim Sharifloo; Mehrnoush Shamsfard
A Bottom Up approach to Persian Stemming

I08-2077: Asif Ekbal; Rejwanul Haque; Sivaji Bandyopadhyay
Named Entity Recognition in Bengali: A Conditional Random Field Approach

I08-2078: Shing-Kit Chan; Wai Lam; Xiaofeng Yu
An Online Cascaded Approach to Biomedical Named Entity Recognition

I08-2079: Qiang Zhou
Automatic Rule Acquisition for Chinese Intra-chunk Relations

I08-2080: Ryohei Sasano; Sadao Kurohashi
Japanese Named Entity Recognition Using Structural Natural Language Processing

I08-2081: YingJu Xia; Hao Yu; Gang Zou
Dimensionality Reduction with Multilingual Resource

I08-2082: Xing Yi; Jianfeng Gao; William B. Dolan
A Web-based English Proofing System for English as a Second Language Users

I08-2083: Hung-Chi Huang; Hsin-Hsi Chen; Ming-Shun Lin
Analysis of Intention in Dialogues Using Category Trees and Its Application to Advertisement Recommendation

I08-2084: Thuy Vu; AiTi Aw; Min Zhang
Term Extraction Through Unithood and Termhood Unification

I08-2085: Yeha Lee; Seung-Hoon Na; Jong-Hyeok Lee
Search Result Clustering Using Label Language Model

I08-2086: HaiXiang Huang; Atsushi Fujii
Effects of Related Term Extraction in Transliteration into Chinese

I08-2087: Dakun Zhang; Le Sun; Wenbo Li
A Structured Prediction Approach for Statistical Machine Translation

I08-2088: Keiji Yasuda; Ruiqiang Zhang; Hirofumi Yamamoto; Eiichiro Sumita
Method of Selecting Training Data to Build a Compact and Efficient Translation Model

I08-2089: Holger Schwenk; Philipp Koehn
Large and Diverse Language Models for Statistical Machine Translation

I08-2090: Javier Couto; Jean-Luc Minel
A Linguistic and Navigational Knowledge Approach to Text Navigation

I08-2091: Virach Sornlertlamvanich; Thatsanee Charoenporn; Chumpol Mokarat; Hitoshi Isahara; Hammam Riza; Purev Jaimai
Synset Assignment for Bi-lingual Dictionary with Limited Resource

I08-2092: Tomoya Noro; Takehiro Tokuda
Ranking Words for Building a Japanese Defining Vocabulary

I08-2093: William D. Lewis; Fei Xia
Automatically Identifying Computationally Relevant Typological Features

I08-2094: Suguru Matsuyoshi; Satoshi Sato
Automatic Paraphrasing of Japanese Functional Expressions Using a Hierarchically Organized Dictionary

I08-2095: Sibabrata Paladhi; Sivaji Bandyopadhyay
Generation of Referring Expression Using Prefix Tree Structure

I08-2096: Tuomo Kakkonen; Erkki Sutinen
Coverage-based Evaluation of Parser Generalizability

I08-2097: Daisuke Kawahara; Kiyotaka Uchimoto
Learning Reliability of Parses for Domain Adaptation of Dependency Parsing

I08-2098: Duen-Chi Yang; Yu-Ming Hsieh; Keh-Jiann Chen
Resolving Ambiguities of Chinese Conjunctive Structures by Divide-and-conquer Approaches

I08-2099: Rafiya Begum; Samar Husain; Arun Dhwaj; Dipti Misra Sharma; Lakshmi Bai; Rajeev Sangal
Dependency Annotation Scheme for Indian Languages

I08-2100: Masaki Murata; Sachiyo Tsukawaki; Toshiyuki Kanamaru; Qing Ma; Hitoshi Isahara
Non-Factoid Japanese Question Answering through Passage Retrieval that Is Weighted Based on Types of Answers

I08-2101: Mohamad Ali Honarpisheh; Gholamreza Ghassem-Sani; Ghassem Mirroshandel
A Multi-Document Multi-Lingual Automatic Summarization System

I08-2102: Megumi Makino; Kazuhide Yamamoto
Summarization by Analogy: An Example-based Approach for News Articles

I08-2103: Donghong Ji; Yu Nie
Sentence Ordering based on Cluster Adjacency in Multi-Document Summarization

I08-2104: Tomoyosi Akiba; Kei Shimizu; Atsushi Fujii
Statistical Machine Translation based Passage Retrieval for Cross-Lingual Question Answering

I08-2105: Vivi Nastase
Unsupervised All-words Word Sense Disambiguation with Grammatical Dependencies

I08-2106: Patrick Saint-Dizier
Syntactic and Semantic Frames in PrepNet

I08-2107: Lin Sun; Anna Korhonen; Yuval Krymolowski
Automatic Classification of English Verbs Using Rich Syntactic Features

I08-2108: Timothy Baldwin; Su Nam Kim; Francis Bond; Sanae Fujita; David Martinez; Takaaki Tanaka
MRD-based Word Sense Disambiguation: Further Extending Lesk

I08-2109: Wanxiang Che; Min Zhang; AiTi Aw; Chew Lim Tan; Ting Liu; Sheng Li
Fast Computing Grammar-driven Convolution Tree Kernel for Semantic Role Labeling

I08-2110: Tomohide Shibata; Michitaka Odani; Jun Harashima; Takashi Oonishi; Sadao Kurohashi
SYNGRAPH: A Flexible Matching Method based on Synonymous Expression Extraction from an Ordinary Dictionary and a Web Corpus

I08-2111: Eduard Bejček; Pavel Straňák; Pavel Schlesinger
Annotation of Multiword Expressions in the Prague Dependency Treebank

I08-2112: Paul McNamee; Rion Snow; Patrick Schone; James Mayfield
Learning Named Entity Hyponyms for Question Answering

I08-2113: Kamadev Bhanuprasad; Mats Svenson
Errgrams -- A Way to Improving ASR for Highly Inflected Dravidian Languages

I08-2114: Sunita Maithani; J.S. Rawat
Noise as a Tool for Spoken Language Identification

I08-2115: Sameer Badaskar; Sachin Agarwal; Shilpa Arora
Identifying Real or Fake Articles: Towards better Language Modeling

I08-2116: Akinori Fujino; Hideki Isozaki; Jun Suzuki
Multi-label Text Categorization with Model Combination based on F1-score Maximization

I08-2117: Manabu Sassano
An Experimental Comparison of the Voted Perceptron and Support Vector Machines in Japanese Analysis Tasks

I08-2118: Venkatesan Chakravarthy; Sachindra Joshi; Ganesh Ramakrishnan; Shantanu Godbole; Sreeram Balakrishnan
Learning Decision Lists with Known Rules for Text Mining

I08-2119: Mengqiu Wang
A Re-examination of Dependency Path Kernels for Relation Extraction

I08-2120: Bo Li; Juan Liu
Mining Chinese-English Parallel Corpora from the Web

I08-2121: Kenji Tateishi; Dai Kusui
Fast Duplicated Documents Detection using Multi-level Prefix-filter

I08-2122: Yoshinobu Kano; Ngan Nguyen; Rune Sætre; Kazuhiro Yoshida; Keiichiro Fukamachi; Yusuke Miyao; Yoshimasa Tsuruoka; Sophia Ananiadou; Jun'ichi Tsujii
Towards Data and Goal Oriented Analysis: Tool Inter-operability and Combinatorial Comparison

I08-2123: Danushka Bollegala; Yutaka Matsuo; Mitsuru Ishizuka
A Co-occurrence Graph-based Approach for Personal Name Alias Extraction from Anchor Texts

I08-2124: Donghui Feng; Gully Burns; Jingbo Zhu; Eduard Hovy
Towards Automated Semantic Analysis on Biomedical Research Articles

I08-2125: Lucian Vlad Lita; Shipeng Yu; Stefan Niculescu; Jinbo Bi
Large Scale Diagnostic Code Classification for Medical Patient Records

I08-2126: Asuka Sumida; Kentaro Torisawa
Hacking Wikipedia for Hyponymy Relation Acquisition

I08-2127: Naoaki Okazaki; Mitsuru Ishizuka; Jun'ichi Tsujii
A Discriminative Approach to Japanese Abbreviation Extraction

I08-2128: Mostafa Al Masum Shaikh; Helmut Prendinger; Mitsuru Ishizuka
Linguistic Interpretation of Emotions for Affect Sensing from Text

I08-2129: Hiroyuki Akama; Maki Miyake; Jaeyoung Jung
How to Take Advantage of the Limitations with Markov Clustering?--The Foundations of Branching Markov Clustering (BMCL)

I08-2130: Hongzhi Xu; Chunping Li
Combining Context Features by Canonical Belief Network for Chinese Part-Of-Speech Tagging

I08-2131: Ahmed Hassan; Sara Noeman; Hany Hassan
Language Independent Text Correction using Finite State Automata

I08-2132: Yaodong Chen; Ting Wang; Huowang Chen; Xishan Xu
Semantic Role Labeling of Chinese Using Transductive SVM and Semantic Heuristics

I08-2133: Maria Georgescul; Alexander Clark; Susan Armstrong
A Comparative Study of Mixture Models for Automatic Topic Segmentation of Multiparty Dialogues

I08-2134: Richard Tzong-Han Tsai; Hsi-Chuan Hung
Exploiting Unlabeled Text to Extract New Words of Different Semantic Transparency for Chinese Word Segmentation

I08-2135: Monojit Choudhury; Animesh Mukherjee; Niloy Ganguly
Social Network Inspired Models of NLP and Language Evolution

I08-2136: Rada Mihalcea; Vivi Nastase
How to Add a New Language on the NLP Map: Building Resources and Tools for Languages with Scarce Resources

I08-2137: Horacio Saggion
Introduction to Text Summarization and Other Information Access Technologies

I08-2138: Mandeep Singh Gill; Gurpreet Singh Lehal; Shiv Sharma Joshi
A Punjabi Grammar Checker

I08-2139: Jiří Mírovský
Netgraph – Making Searching in Treebanks Easy

I08-2140: Son Doan; QuocHung-Ngo; Ai Kawazoe; Nigel Collier
Global Health Monitor - A Web-based System for Detecting and Mapping Infectious Diseases

I08-2141: Anil Kumar Singh
A Mechanism to Provide Language-Encoding Support and an NLP Friendly Editor

I08-2142: Ruvan Weerasinghe; Asanka Wasala; Dulip Herath; Viraj Welgama
NLP Applications of Sinhala: TTS & OCR

I08-2143: Swati Gupta; Marilyn A. Walker; Daniela M. Romano
POLLy: A Conversational System that uses a Shared Representation to Generate Action and Social Language

I08-2144: CLIA Consortium
Cross Lingual Information Access System for Indian Languages

I08-2145:
Author Index

Proceedings of the IJCNLP-08 Workshop on NLP for Less Privileged Languages

I08-3 [bib]: Entire volume

I08-3000: Front Matter

I08-3001: Anne David; Micheal Maxwell
Invited Talk: Building Language Resources: Ways to move forward

I08-3002: Virach Sornlertlamvanich
Invited Talk: Cross Language Resource Sharing

I08-3003: Monojit Choudhury
Invited Talk: Breaking the Zipfian Barrier of NLP

I08-3004: Anil Kumar Singh
Natural Language Processing for Less Privileged Languages: Where do we come from? Where are we going?

I08-3005: Thatsanee Charoenporn; Virach Sornlertlamvanich; Hitoshi Isahara; Kergrit Robkop
KUI: an ubiquitous tool for collective intelligence development

I08-3006: Tirthankar Dasgupta; Sandipan Dandpat; Anupam Basu
Prototype Machine Translation System From Text-To-Indian Sign Language

I08-3007: Michael Maxwell; Anne David
Joint Grammar Development by Linguists and Computer Scientists

I08-3008: Daniel Zeman; Philip Resnik
Cross-Language Parser Adaptation between Related Languages

I08-3009: Sandeva Goonetilleke; Yoshihiko Hayashi; Yuichi Itoh; Fumio Kishino
SriShell Primo: A Predictive Sinhala Text Input System

I08-3010: Zin Maung Maung; Yoshiki Mikami
A Rule-based Syllable Segmentation of Myanmar Text

I08-3011: I. Alegria; X. Arregi; X. Artola; A. Diaz de Ilarraza; G. Labaka; M. Lersundi; A. Mayor; K. Sarasola
Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open-source

I08-3012: Sandipan Sarkar; Sivaji Bandyopadhyay
Design of a Rule-based Stemmer for Natural Language Text in Bengali

I08-3013: Jackson Muhirwe; Trond Trosterud
Finite State Solutions For Reduplication In Kinyarwanda Language

I08-3014: Mohammad Naveed Ali; M. A. Khan; Muhammad Aamir Khan
An Optimal Order of Factors for the Computational Treatment of Personal Anaphoric Devices in Urdu Discourse

I08-3015: Thoudam Doren Singh; Sivaji Bandyopadhyay
Morphology Driven Manipuri POS Tagger

I08-3016: Krishnakumar V; Indrani Roy
Acharya - A Text Editor and Framework for working with Indic Scripts

I08-3017: R.K. Aggarwal; M. Dave
Implementing a Speech Recognition System Interface for Indian Languages

I08-3018: Hammam Riza
Indigenous Languages of Indonesia: Creating Language Resources for Language Preservation

I08-3019: Chirag Patel; Karthik Gali
Part-Of-Speech Tagging for Gujarati Using Conditional Random Fields

I08-3020: David Ellis; Mathias Creutz; Timo Honkela; Mikko Kurimo
Speech to speech machine translation: Biblical chatter from Finnish to English

Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing

I08-4: Entire volume

I08-4000: Front Matter

I08-4001: Zhou-Jun Li; Wen-Han Chao; Yue-Xin Chen
An Example-based Decoder for Spoken Language Machine Translation

I08-4002: Chengguo Jin; Seung-Hoon Na; Dong-Il Kim; Jong-Hyeok Lee
Automatic Extraction of English-Chinese Transliteration Pairs using Dynamic Window and Tokenizer

I08-4003: Jin-Shea Kuo; Haizhou Li; Chih-Lung Lin
Mining Transliterations from Web Query Results: An Incremental Approach

I08-4004: Feiliang Ren; Jingbo Zhu
An Effective Hybrid Machine Learning Approach for Coreference Resolution

I08-4005: Yuchang Cheng; Masayuki Asahara; Yuji Matsumoto
Use of Event Types for Temporal Relation Identification in Chinese Text

I08-4006: Jinghua Wang; Jianyi Liu; Ping Zhang
Chinese Word Sense Disambiguation with PageRank and HowNet

I08-4007: Bor-shen Lin
Stochastic Dependency Parsing Based on A* Admissible Search

I08-4008: Jia Lu; Masayuki Asahara; Yuji Matsumoto
Analyzing Chinese Synthetic Words with Tree-based Information and a Survey on Chinese Morphologically Derived Words

I08-4009: Zhenxing Wang; Changning Huang; Jingbo Zhu
Which Performs Better on In-Vocabulary Word Segmentation: Based on Word or Character?

I08-4010: Guangjin Jin; Xiao Chen
The Fourth International Chinese Language Processing Bakeoff: Chinese Word Segmentation, Named Entity Recognition and Chinese POS Tagging

I08-4011: Aitao Chen; Ya Zhang; Gordon Sun
A Two-Stage Approach to Chinese Part-of-Speech Tagging

I08-4012: Jiang Li; Rile Hu; Guohua Zhang; Yuezhong Tang; Zhanjiang Song; Xia Wang
NOKIA Research Center Beijing Chinese Word Segmentation System for the SIGHAN Bakeoff 2007

I08-4013: Xinnian Mao; Yuan Dong; Saike He; Sencheng Bao; Haila Wang
Chinese Word Segmentation and Named Entity Recognition Based on Conditional Random Fields

I08-4014: Ying Qin; Caixia Yuan; Jiashen Sun; Xiaojie Wang
BUPT Systems in the SIGHAN Bakeoff 2007

I08-4015: Zhenxing Wang; Changning Huang; Jingbo Zhu
The Character-based CRF Segmenter of MSRA&NEU for the 4th Bakeoff

I08-4016: Xiaofeng Yu; Wai Lam; Shing-Kit Chan; Yiu Kei Wu; Bo Chen
Chinese NER Using CRFs and Logic for the Fourth SIGHAN Bakeoff

I08-4017: Hai Zhao; Chunyu Kit
Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition

I08-4018: Samuel W.K. Chan; Mickey W.C. Chong
An Agent-Based Approach to Chinese Word Segmentation

I08-4019: Xiaohe Chen; Bin Li; Junzhi Lu; Hongdong Nian; Xuri Tang
Nanjing Normal University Segmenter for the Fourth SIGHAN Bakeoff

I08-4020: Yuanyong Feng; Ruihong Huang; Le Sun
Two Step Chinese Named Entity Recognition Based on Conditional Random Fields Models

I08-4021: Guohong Fu; Jonathan J. Webster
A Morpheme-based Part-of-Speech Tagger for Chinese

I08-4022: Jingzhou He; Houfeng Wang
Chinese Named Entity Recognition and Word Segmentation Based on Character

I08-4023: Degen Huang; Xiao Sun; Shidou Jiao; Lishuang Li; Zhuoye Ding; Ru Wan
HMM and CRF Based Hybrid Model for Chinese Lexical Analysis

I08-4024: Ka Seng Leong; Fai Wong; Yiping Li; Ming Chui Dong
Chinese Tagging Based on Maximum Entropy Model

I08-4025: Dong Song; Anoop Sarkar
Training a Perceptron with Global and Local Features for Chinese Word Segmentation

I08-4026: Guang-Lu Sun; Cheng-Jie Sun; Ke Sun; Xiao-Long Wang
A Study of Chinese Lexical Analysis Based on Discriminative Models

I08-4027: Jia-Lin Tsai
Word Boundary Token Model for the SIGHAN Bakeoff 2007

I08-4028: Xihong Wu; Xiaojun Lin; Xinhao Wang; Chunyao Wu; Yaozhong Zhang; Dianhai Yu
An Improved CRF based Chinese Language Processing System for SIGHAN Bakeoff 2007

I08-4029: Yu-Chieh Wu; Jie-Chi Yang; Yue-Shi Lee
Description of the NCU Chinese Word Segmentation and Part-of-Speech Tagging for SIGHAN Bakeoff 2007

I08-4030: Zhiting Xu; Xian Qian; Yuejie Zhang; Yaqian Zhou
CRF-based Hybrid Model for Word Segmentation, NER and even POS Tagging

I08-4031: Fan Yang; Jun Zhao; Bo Zou
CRFs-Based Named Entity Recognition Incorporated with Heuristic Entity List Searching

I08-4032: Jianfeng Zhang; Jiaheng Zheng; Hu Zhang; Hongye Tan
A Chinese Word Segmentation System Based on Cascade Model

I08-4033: Ruiqiang Zhang; Eiichiro Sumita
Achilles: NiCT/ATR Chinese Morphological Analyzer for the Fourth Sighan Bakeoff

I08-4034:
Author Index

Proceedings of the IJCNLP-08 Workshop on Named Entity Recognition for South and South East Asian Languages

I08-5 [bib]: Entire volume

I08-5000: Front Matter

I08-5001: Sobha L
Invited Talk: Named Entity Recognition: Different Approaches

I08-5002: Sivaji Bandyopadhyay
Invited Talk: Multilingual Named Entity Recognition

I08-5003: Anil Kumar Singh
Named Entity Recognition for South and South East Asian Languages: Taking Stock

I08-5004: Sujan Kumar Saha; Sanjay Chatterji; Sandipan Dandapat; Sudeshna Sarkar; Pabitra Mitra
A Hybrid Named Entity Recognition System for South and South East Asian Languages

I08-5005: Karthik Gali; Harshit Surana; Ashwini Vaidya; Praneeth Shishtla; Dipti Misra Sharma
Aggregating Machine Learning and Rule Based Heuristics for Named Entity Recognition

I08-5006: Asif Ekbal; Rejwanul Haque; Amitava Das; Venkateswarlu Poka; Sivaji Bandyopadhyay
Language Independent Named Entity Recognition in Indian Languages

I08-5007: P Srikanth; Kavi Narayana Murthy
Named Entity Recognition for Telugu

I08-5008: Asif Ekbal; Sivaji Bandyopadhyay
Bengali Named Entity Recognition Using Support Vector Machine

I08-5009: Vijayakrishna R; Sobha L
Domain Focused Named Entity Recognizer for Tamil Using Conditional Random Fields

I08-5010: Praneeth M Shishtla; Prasad Pingali; Vasudeva Varma
A Character n-gram Based Approach for Improved Recall in Indian Language NER

I08-5011: Bidyut Baran Chaudhuri; Suvankar Bhattacharya
An Experiment on Automatic Detection of Named Entities in Bangla

I08-5012: Praveen P; Ravi Kiran V
Hybrid Named Entity Recognition System for South and South East Asian Languages

I08-5013: Amit Goyal
Named Entity Recognition for South Asian Languages

I08-5014: Animesh Nayan; B. Ravi Kiran Rao; Pawandeep Singh; Sudip Sanyal; Ratna Sanyal
Named Entity Recognition for Indian Languages

I08-5015: Praneeth M Shishtla; Karthik Gali; Prasad Pingali; Vasudeva Varma
Experiments in Telugu NER: A Conditional Random Field Approach

Proceedings of the 2nd workshop on Cross Lingual Information Access (CLIA) Addressing the Information Need of Multilingual Societies

I08-6: Entire volume

I08-6000: Front Matter

I08-6001: Peter Chew; Ahmed Abdelali
The Effects of Language Relatedness on Multilingual Information Retrieval: A Case Study With Indo-European and Semitic Languages

I08-6002: Pattabhi R K Rao T; Sobha L
Identifying Similar and Co-referring Documents Across Languages

I08-6003: Achim Ruopp; Fei Xia
Finding parallel texts on the web using cross-language information retrieval

I08-6004: K Saravanan; A Kumaran
Some Experiments in Mining Named Entity Transliteration Pairs from Comparable Corpora

I08-6005: Gareth Jones; Fabio Fantino; Eamonn Newman; Ying Zhang
Domain-Specific Query Translation for Multilingual Information Access using Machine Translation Augmented With Dictionaries Mined from Wikipedia

I08-6006: Prasad Pingali; Suryaganesh; Sreeharsha Yella; Vasudeva Varma
Statistical Transliteration for Cross Language Information Retrieval using HMM alignment model and CRF

I08-6007: Anurag Bhardwaj; Damien Jose; Venu Govindaraju
Script Independent Word Spotting in Multilingual Documents

I08-6008: Sibabrata Paladhi; Sivaji Bandyopadhyay
A Document Graph Based Query Focused Multi-Document Summarizer

I08-6009: Manoj Kumar Chinnakotla; Sagar Ranadive; Om P. Damani; Pushpak Bhattacharyya
Hindi and Marathi to English Cross Language Information Retrieval

I08-6010: Debasis Mandal; Sandipan Dandapat; Mayank Gupta; Pratyush Banerjee; Sudeshna Sarkar
Bengali and Hindi to English CLIR Evaluation

I08-6011: Sivaji Bandyopadhyay; Tapabrata Mondal; Sudip Kumar Naskar; Asif Ekbal; Rejwanul Haque; Srinivasa Rao Godavarthy
Bengali, Hindi and Telugu to English Ad-hoc Bilingual Task

I08-6012: Jagadeesh Jagarlamudi; A Kumaran
Cross-Lingual Information Retrieval System for Indian Languages

I08-6013: Prasad Pingali; Vasudeva Varma
Hindi and Telugu to English CLIR using Query Expansion

I08-6014: Mandar Mitra; Prosenjit Majumdar
FIRE: Forum for Information Retrieval Evaluation

I08-6015:
Author Index

Proceedings of the 6th Workshop on Asian Language Resources

I08-7: Entire volume

I08-7000: Front Matter

I08-7001: Asif Ekbal; Sivaji Bandyopadhyay
Development of Bengali Named Entity Tagged Corpus and its Use in NER Systems

I08-7002: Sujan Kumar Saha; Sudeshna Sarkar; Pabitra Mitra
Gazetteer Preparation for Named Entity Recognition in Indian Languages

I08-7003: Gaoying Cui; Qin Lu; Wenjie Li
Preliminary Chinese Term Classification for Ontology Construction

I08-7004: Makiko Matsuda; Tomoe Takahashi; Hiroki Goto; Yoshikazu Hayase; Robin Lee Nagano; Yoshiki Mikami
Technical Terminology in Asian Languages: Different Approaches to Adopting Engineering Terms

I08-7005: Wunna Ko Ko; Thin Zar Phyo
Selection of XML tag set for Myanmar National Corpus

I08-7006: Hla Hla Htay; Kavi Narayana Murthy
Myanmar Word Segmentation using Syllable level Longest Matching

I08-7007: Rizza Caminero; Yoshiki Mikami
The Link Structure of Language Communities and its Implication for Language-specific Crawling

I08-7008: Tirthankar Dasgupta; Sambit Shukla; Sandeep Kumar; Synny Diwakar; Anupam Basu
A Multilingual Multimedia Indian Sign Language Dictionary Tool

I08-7009: Deniz Zeyrek; Bonnie Webber
A Discourse Resource for Turkish: Annotating Discourse Connectives in the METU Corpus

I08-7010: Rashmi Prasad; Samar Husain; Dipti Sharma; Aravind Joshi
Towards an Annotated Corpus of Discourse Relations in Hindi

I08-7011: Yin-Sheng Tai; D. Victoria Rau; Meng-Chien Yang
A Semantic Study on Yami Ontology in Traditional Songs

I08-7012: Rama Sree R.J.; Uma Maheswara Rao G; Madhu Murthy K.V.
Assessment and Development of POS Tag Set for Telugu

I08-7013: Sankaran Baskaran; Kalika Bali; Tanmoy Bhattacharya; Pushpak Bhattacharyya; Girish Nath Jha; Rajendran S; Saravanan K; Sobha L; Subbarao K V
Designing a Common POS-Tagset Framework for Indian Languages

I08-7014: Hammam Riza
Resources Report on Languages of Indonesia

I08-7015: Ryo Nishimura; Yasuhiko Watanabe; Yoshihiro Okada
Confirmed Language Resource for Answering How Type Questions Developed by Using Mails Posted to a Mailing List

I08-7016: Purev Jaimai; Odbayar Chimeddorj
Corpus building for Mongolian language

I08-7017: Sarmad Hussain
Resources for Urdu Language Processing

I08-7018: Kikuo Maekawa
Balanced Corpus of Contemporary Written Japanese

I08-7019: Viet Hoang-Anh; Thu Dinh-Thi-Phuong; Thang Huynh-Quyet
A Basic Framework to Build a Test Collection for the Vietnamese Text Catergorization

I08-7020: Virach Sornlertlamvanich; Thatsanee Charoenporn; Suphanut Thayaboon; Chumpol Mokarat; Hitoshi Isahara
Enhanced Tools for Online Collaborative Language Resource Development

I08-7021: Shuichi Itahashi; Koiti Hasida
Japanese Effort Toward Sharing Text and Speech Corpora

I08-7022:
Author Index

Proceedings of the Workshop on Technologies and Corpora for Asia-Pacific Speech Translation (TCAST)

I08-8 [bib]: Entire volume

I08-8000: Front Matter

I08-8001: Jonghoon Lee; Donghyeon Lee; Gary Geunbae Lee
Transformation-based Sentence Splitting method for Statistical Machine Translation

I08-8002: Chai Wutiwiwatchai; Thepchai Supnithi; Krit Kosawat
Speech-to-Speech Translation Activities in Thailand

I08-8003: Andrew Finch; Eiichiro Sumita
Phrase-based Machine Transliteration

I08-8004: Sakriani Sakti; Eka Kelana; Hammam Riza; Shinsuke Sakai; Konstantin Markov; Satoshi Nakamura
Development of Indonesian Large Vocabulary Continuous Speech Recognition System within A-STAR Project

I08-8005: Hyungbae Jeon; Kyuwoong Hwang; Hoon Chung; Seunghi Kim; Jun Park; Yunkeun Lee
Using Confidence Vector in Multi-Stage Speech Recognition

I08-8006: Hammam Riza; Oskar Riandi
Toward Asian Speech Translation System: Developing Speech Recognition and Machine Translation for Indonesian Language