On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF)

Menke P, Ell B, Cimiano P (2017)
Applied Ontology 12(2): 131-155.

Zeitschriftenaufsatz | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Abstract / Bemerkung
Representing provenance information for data is of crucial importance for data reuse. This is in particular the case for language resources such as annotated corpora. NIF has been proposed as an RDF vocabulary to support the representation of text data together with annotations. However, NIF suffers from severe shortcomings with respect to its ability to represent provenance information. As a remedy to this, we present MOND, a new glue ontology that implements an interface between NIF and the PROV-O ontology to support the inclusion of provenance information into NIF annotated datasets. We first present an approach that reifies annotations and allows the attachment of any provenance metadata to annotations at arbitrary granularity. We show that this approach has an important drawback as it roughly doubles the size of the data. Building on this observation, we design the MOND glue ontology that implements a modular approach in which annotation metadata is not attached to single annotations but to modules that represent collections of annotations of the same type and origin. This yields a moderate increase in data size, while maintaining all the benefits of the first approach. We validate our approach on three use cases that represent prototypical needs in corpus work.
Erscheinungsjahr
2017
Zeitschriftentitel
Applied Ontology
Band
12
Ausgabe
2
Seite(n)
131-155
ISSN
1570-5838
eISSN
1875-8533
Page URI
https://pub.uni-bielefeld.de/record/2916592

Zitieren

Menke P, Ell B, Cimiano P. On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology. 2017;12(2):131-155.
Menke, P., Ell, B., & Cimiano, P. (2017). On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology, 12(2), 131-155. doi:10.3233/AO-170180
Menke, Peter, Ell, Basil, and Cimiano, Philipp. 2017. “On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF)”. Applied Ontology 12 (2): 131-155.
Menke, P., Ell, B., and Cimiano, P. (2017). On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology 12, 131-155.
Menke, P., Ell, B., & Cimiano, P., 2017. On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology, 12(2), p 131-155.
P. Menke, B. Ell, and P. Cimiano, “On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF)”, Applied Ontology, vol. 12, 2017, pp. 131-155.
Menke, P., Ell, B., Cimiano, P.: On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF). Applied Ontology. 12, 131-155 (2017).
Menke, Peter, Ell, Basil, and Cimiano, Philipp. “On the origin of annotations: A module-based approach to representing annotations in the Natural Language Processing Interchange Format (NIF)”. Applied Ontology 12.2 (2017): 131-155.
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Web of Science

Dieser Datensatz im Web of Science®
Suchen in

Google Scholar