Spell Checking for Chinese

Shaohua Yang, Hai Zhao, Xiaolin Wang, Bao-liang Lu


Abstract
This paper presents some novel results on Chinese spell checking. In this paper, a concise algorithm based on minimized-path segmentation is proposed to reduce the cost and suit the needs of current Chinese input systems. The proposed algorithm is actually derived from a simple assumption that spelling errors often make the number of segments larger. The experimental results are quite positive and implicitly verify the effectiveness of the proposed assumption. Finally, all approaches work together to output a result much better than the baseline with 12% performance improvement.
Anthology ID:
L12-1423
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
730–736
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/727_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Shaohua Yang, Hai Zhao, Xiaolin Wang, and Bao-liang Lu. 2012. Spell Checking for Chinese. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 730–736, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Spell Checking for Chinese (Yang et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/727_Paper.pdf