LDC Language Resource Database: Building a Bibliographic Database

Eleftheria Ahtaridis, Christopher Cieri, Denise DiPersio


Abstract
The Linguistic Data Consortium (LDC) creates and provides language resources (LRs) including data, tools and specifications. In order to assess the impact of these LRs and to support both LR users and authors, LDC is collecting metadata about and URLs for research papers that introduce, describe, critique, extend or rely upon LDC LRs. Current collection efforts focus on papers published in journals and conference proceedings that are available online. To date, nearly 300, or over half of the LRs LDC distributes have been searched for extensively and almost 8000 research papers about these LRs have been documented. This paper discusses the issues with collecting references and includes preliminary analysis of those results. The remaining goals of the project are also outlined.
Anthology ID:
L12-1549
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1723–1728
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/916_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Eleftheria Ahtaridis, Christopher Cieri, and Denise DiPersio. 2012. LDC Language Resource Database: Building a Bibliographic Database. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1723–1728, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
LDC Language Resource Database: Building a Bibliographic Database (Ahtaridis et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/916_Paper.pdf