A corpus-based approach for the induction of ontology lexica

Walter S, Unger C, Cimiano P (2013)
Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013).

Konferenzbeitrag | Veröffentlicht | Englisch
 
Download
OA
Herausgeber*in
Métais, Elisabeth; Meziane, Farid; Saraee, Mohamed; Sugumaran, Vijay; Vadera, Sunil
Abstract / Bemerkung
While there are many large knowledge bases (e.g. Freebase, Yago, DBpedia) as well as linked data sets available on the web, they typically lack lexical information stating how the properties and classes are realized lexically. If at all, typically only one label is attached to these properties, thus lacking any deeper syntactic information, e.g. about syntactic arguments and how these map to the semantic arguments of the property as well as about possible lexical variants or paraphrases. While there are lexicon models such as \emph{lemon} allowing to define a lexicon for a given ontology, the cost involved in creating and maintaining such lexica is substantial, requiring a high manual effort. Towards lowering this effort, in this paper we present a semi-automatic approach that exploits a corpus to find occurrences in which a given property is expressed, and generalizing over these occurrences by extracting dependency paths that can be used as a basis to create lemon lexicon entries. We evaluate the resulting automatically generated lexica with respect to DBpedia as dataset and Wikipedia as corresponding corpus, both in an automatic mode, by comparing to a manually created lexicon, and in a semi-automatic mode in which a lexicon engineer inspected the results of the corpus-based approach, adding them to the existing lexicon if appropriate.
Stichworte
corpus-based approach; lemon; ontology lexicalization
Erscheinungsjahr
2013
Konferenz
18th International Conference on Application of Natural Language to Information Systems (NLDB 2013)
Page URI
https://pub.uni-bielefeld.de/record/2584967

Zitieren

Walter S, Unger C, Cimiano P. A corpus-based approach for the induction of ontology lexica. Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013).
Walter, S., Unger, C., & Cimiano, P. (2013). A corpus-based approach for the induction of ontology lexica. Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013).
Walter, Sebastian, Unger, Christina, and Cimiano, Philipp. 2013. “A corpus-based approach for the induction of ontology lexica”. Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013) , ed. Elisabeth Métais, Farid Meziane, Mohamed Saraee, Vijay Sugumaran, and Sunil Vadera. Springer, LNCS.
Walter, S., Unger, C., and Cimiano, P. (2013).“A corpus-based approach for the induction of ontology lexica”. Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013).
Walter, S., Unger, C., & Cimiano, P., 2013. A corpus-based approach for the induction of ontology lexica. Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013)
S. Walter, C. Unger, and P. Cimiano, “A corpus-based approach for the induction of ontology lexica”, Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013), Springer, LNCS, 2013.
Walter, S., Unger, C., Cimiano, P.: A corpus-based approach for the induction of ontology lexica. Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013) (2013).
Walter, Sebastian, Unger, Christina, and Cimiano, Philipp. “A corpus-based approach for the induction of ontology lexica”. Presented at the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013), Springer, LNCS, 2013.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Access Level
OA Open Access
Zuletzt Hochgeladen
2019-09-06T09:18:13Z
MD5 Prüfsumme
3ab4ade91ea7665e155ef917201e4807


Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar