WebAnnotator, an Annotation Tool for Web Pages

Xavier Tannier


Abstract
This article presents WebAnnotator, a new tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension, allowing annotation of both offline and inline pages. The HTML rendering fully preserved and all annotations consist in new HTML spans with specific styles. WebAnnotator provides an easy and general-purpose framework and is made available under CeCILL free license (close to GNU GPL), so that use and further contributions are made simple. All parts of an HTML document can be annotated: text, images, videos, tables, menus, etc. The annotations are created by simply selecting a part of the document and clicking on the relevant type and subtypes. The annotated elements are then highlighted in a specific color. Annotation schemas can be defined by the user by creating a simple DTD representing the types and subtypes that must be highlighted. Finally, annotations can be saved (HTML with highlighted parts of documents) or exported (in a machine-readable format).
Anthology ID:
L12-1021
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
316–319
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/148_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Xavier Tannier. 2012. WebAnnotator, an Annotation Tool for Web Pages. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 316–319, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
WebAnnotator, an Annotation Tool for Web Pages (Tannier, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/148_Paper.pdf