On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF)

Menke P, Ell B, Cimiano P (2017)
Applied Ontology 12(2): 131-155.

Download
Es wurde kein Volltext hochgeladen. Nur Publikationsnachweis!
Zeitschriftenaufsatz | Veröffentlicht | Englisch
Autor
Abstract / Bemerkung
Representing provenance information for data is of crucial importance for data reuse. This is in particular the case for language resources such as annotated corpora. NIF has been proposed as an RDF vocabulary to support the representation of text data together with annotations. However, NIF suffers from severe shortcomings with respect to its ability to represent provenance information. As a remedy to this, we present MOND, a new glue ontology that implements an interface between NIF and the PROV-O ontology to support the inclusion of provenance information into NIF annotated datasets. We first present an approach that reifies annotations and allows the attachment of any provenance metadata to annotations at arbitrary granularity. We show that this approach has an important drawback as it roughly doubles the size of the data. Building on this observation, we design the MOND glue ontology that implements a modular approach in which annotation metadata is not attached to single annotations but to modules that represent collections of annotations of the same type and origin. This yields a moderate increase in data size, while maintaining all the benefits of the first approach. We validate our approach on three use cases that represent prototypical needs in corpus work.
Erscheinungsjahr
Zeitschriftentitel
Applied Ontology
Band
12
Zeitschriftennummer
2
Seite
131-155
ISSN
eISSN
PUB-ID

Zitieren

Menke P, Ell B, Cimiano P. On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology. 2017;12(2):131-155.
Menke, P., Ell, B., & Cimiano, P. (2017). On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology, 12(2), 131-155. doi:10.3233/AO-170180
Menke, P., Ell, B., and Cimiano, P. (2017). On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology 12, 131-155.
Menke, P., Ell, B., & Cimiano, P., 2017. On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology, 12(2), p 131-155.
P. Menke, B. Ell, and P. Cimiano, “On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF)”, Applied Ontology, vol. 12, 2017, pp. 131-155.
Menke, P., Ell, B., Cimiano, P.: On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology. 12, 131-155 (2017).
Menke, Peter, Ell, Basil, and Cimiano, Philipp. “On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF)”. Applied Ontology 12.2 (2017): 131-155.