Generation of Verbal Stems in Derivationally Rich Language

Krešimir Šojat, Nives Mikelić Preradović, Marko Tadić


Abstract
The paper presents a procedure for generating prefixed verbs in Croatian comprising combinations of one, two or three prefixes. The result of this generation process is a pool of derivationally valid prefixed verbs, although not necessarily occuring in corpora. The statistics of occurences of generated verbs in Croatian National Corpus has been calculated. Further usage of such language resource with generated potential verbs is also suggested, namely, enrichment of Croatian Morphological Lexicon, Croatian Wordnet and CROVALLEX.
Anthology ID:
L12-1632
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
928–933
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1061_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Krešimir Šojat, Nives Mikelić Preradović, and Marko Tadić. 2012. Generation of Verbal Stems in Derivationally Rich Language. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 928–933, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Generation of Verbal Stems in Derivationally Rich Language (Šojat et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1061_Paper.pdf