MultiVal - towards a multilingual valence lexicon

Lars Hellan, Dorothee Beermann, Tore Bruland, Mary Esther Kropp Dakubu, Montserrat Marimon


Abstract
MultiVal is a valence lexicon derived from lexicons of computational HPSG grammars for Norwegian, Spanish and Ga (ISO 639-3, gaa), with altogether about 22,000 verb entries and on average more than 200 valence types defined for each language. These lexical resources are mapped onto a common set of discriminants with a common array of values, and stored in a relational database linked to a web demo and a wiki presentation. Search discriminants are ‘syntactic argument structure’ (SAS), functional specification, situation type and aspect, for any subset of languages, as well as the verb type systems of the grammars. Search results are lexical entries satisfying the discriminants entered, exposing the specifications from the respective provenance grammars. The Ga grammar lexicon has in turn been converted from a Ga Toolbox lexicon. Aside from the creation of such a multilingual valence resource through converging or converting existing resources, the paper also addresses a tool for the creation of such a resource as part of corpus annotation for less resourced languages.
Anthology ID:
L14-1124
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2478–2485
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1179_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Lars Hellan, Dorothee Beermann, Tore Bruland, Mary Esther Kropp Dakubu, and Montserrat Marimon. 2014. MultiVal - towards a multilingual valence lexicon. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2478–2485, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
MultiVal - towards a multilingual valence lexicon (Hellan et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1179_Paper.pdf