Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population

Ell B, Hakimov S, Braukmann P, Cazzoli L, Kaupmann F, Mancino A, Altaf Memon J, Rother K, Saini A, Cimiano P (Accepted)
Presented at the Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017, Vienna.

Download
OA 334.89 KB
Conference Paper | Accepted | English
Author
; ; ; ; ; ; ; ; ;
Abstract
Web Table Understanding in the context of Knowledge Base Population and the Semantic Web is the task of i) linking the content of tables retrieved from the Web to an RDF knowledge base, ii) of building hypotheses about the tables' structures and contents, iii) of extracting novel information from these tables, and iv) of adding this new information to a knowledge base. Knowledge Base Population has gained more and more interest in the last years due to the increased demand in large knowledge graphs which became relevant for Artificial Intelligence applications such as Question Answering and Semantic Search. In this paper we describe a set of basic tasks which are relevant for Web Table Understanding in the mentioned context. These tasks incrementally enrich a table with hypotheses about the table's content. In doing so, in the case of multiple interpretations, selecting one interpretation and thus deciding against other interpretations is avoided as much as possible. By postponing these decision, we enable table understanding approaches to decide by themselves, thus increasing the usability of the annotated table data. We present statistics from analyzing and annotating 1.000.000 tables from the Web Table Corpus 2015 and make this dataset as well as our code available online.
Publishing Year
Conference
Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017
Location
Vienna
PUB-ID

Cite this

Ell B, Hakimov S, Braukmann P, et al. Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population. Presented at the Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017, Vienna.
Ell, B., Hakimov, S., Braukmann, P., Cazzoli, L., Kaupmann, F., Mancino, A., Altaf Memon, J., et al. (Accepted). Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population. Presented at the Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017, Vienna.
Ell, B., Hakimov, S., Braukmann, P., Cazzoli, L., Kaupmann, F., Mancino, A., Altaf Memon, J., Rother, K., Saini, A., and Cimiano, P. (Accepted).“Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population”. Presented at the Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017, Vienna.
Ell, B., et al., Accepted. Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population. Presented at the Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017, Vienna.
B. Ell, et al., “Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population”, Presented at the Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017, Vienna, Accepted.
Ell, B., Hakimov, S., Braukmann, P., Cazzoli, L., Kaupmann, F., Mancino, A., Altaf Memon, J., Rother, K., Saini, A., Cimiano, P.: Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population. Presented at the Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017, Vienna (Accepted).
Ell, Basil, Hakimov, Sherzod, Braukmann, Philipp, Cazzoli, Lorenzo, Kaupmann, Fabian, Mancino, Amerigo, Altaf Memon, Junaid, Rother, Kai, Saini, Abhishek, and Cimiano, Philipp. “Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population”. Presented at the Fifth international workshop on Linked Data for Information Extraction (LD4IE) at ISWC2017, Vienna, Accepted.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
Access Level
OA Open Access
Last Uploaded
2017-09-05T09:07:36Z

This data publication is cited in the following publications:
This publication cites the following data publications:

Export

0 Marked Publications

Open Data PUB

Search this title in

Google Scholar