WeDH - a Friendly Tool for Building Literary Corpora Enriched with Encyclopedic Metadata

Mattia Egloff, Davide Picca


Abstract
In recent years the interest in the use of repositories of literary works has been successful. While many efforts related to Linked Open Data go in the right direction, the use of these repositories for the creation of text corpora enriched with metadata remains difficult and cumbersome. In fact, many of these repositories can be useful to the community not only for the automatic creation of textual corpora but also for retrieving crucial meta-information about texts. In particular, the use of metadata provides the reader with a wealth of information that is often not identifiable in the texts themselves. Our project aims to fill both the access to the textual resources available on the web and the possibility of combining these resources with sources of metadata that can enrich the texts with useful information lengthening the life and maintenance of the data itself. We introduce here a user-friendly web interface of the Digital Humanities toolkit named WeDH with which the user can leverage the encyclopedic knowledge provided by DBpedia, wikidata and VIAF in order to enrich the corpora with bibliographical and exegetical knowledge. WeDH is a collaborative project and we invite anyone who has ideas or suggestions regarding this procedure to reach out to us.
Anthology ID:
2020.lrec-1.101
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
813–816
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.101
DOI:
Bibkey:
Cite (ACL):
Mattia Egloff and Davide Picca. 2020. WeDH - a Friendly Tool for Building Literary Corpora Enriched with Encyclopedic Metadata. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 813–816, Marseille, France. European Language Resources Association.
Cite (Informal):
WeDH - a Friendly Tool for Building Literary Corpora Enriched with Encyclopedic Metadata (Egloff & Picca, LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.101.pdf