Federated Search: Towards a Common Search Infrastructure

Herman Stehouwer, Matej Durco, Eric Auer, Daan Broeder


Abstract
Within scientific institutes there exist many language resources. These resources are often quite specialized and relatively unknown. The current infrastructural initiatives try to tackle this issue by collecting metadata about the resources and establishing centers with stable repositories to ensure the availability of the resources. It would be beneficial if the researcher could, by means of a simple query, determine which resources and which centers contain information useful to his or her research, or even work on a set of distributed resources as a virtual corpus. In this article we propose an architecture for a distributed search environment allowing researchers to perform searches in a set of distributed language resources.
Anthology ID:
L12-1291
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3255–3259
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/524_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Herman Stehouwer, Matej Durco, Eric Auer, and Daan Broeder. 2012. Federated Search: Towards a Common Search Infrastructure. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3255–3259, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Federated Search: Towards a Common Search Infrastructure (Stehouwer et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/524_Paper.pdf