Improving Open Relation Extraction via Sentence Re-Structuring

Jordan Schmidek, Denilson Barbosa


Abstract
Information Extraction is an important task in Natural Language Processing, consisting of finding a structured representation for the information expressed in natural language text. Two key steps in information extraction are identifying the entities mentioned in the text, and the relations among those entities. In the context of Information Extraction for the World Wide Web, unsupervised relation extraction methods, also called Open Relation Extraction (ORE) systems, have become prevalent, due to their effectiveness without domain-specific training data. In general, these systems exploit part-of-speech tags or semantic information from the sentences to determine whether or not a relation exists, and if so, its predicate. This paper discusses some of the issues that arise when even moderately complex sentences are fed into ORE systems. A process for re-structuring such sentences is discussed and evaluated. The proposed approach replaces complex sentences by several others that, together, convey the same meaning and are more amenable to extraction by current ORE systems. The results of an experimental evaluation show that this approach succeeds in reducing the processing time and increasing the accuracy of the state-of-the-art ORE systems.
Anthology ID:
L14-1029
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3720–3723
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1038_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Jordan Schmidek and Denilson Barbosa. 2014. Improving Open Relation Extraction via Sentence Re-Structuring. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3720–3723, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Improving Open Relation Extraction via Sentence Re-Structuring (Schmidek & Barbosa, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1038_Paper.pdf