Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-based Information Extraction

ter Horst H, Hartung M, Klinger R, Brazda N, Müller HW, Cimiano P (Accepted)
In: Proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB).

Download
OA 442.44 KB
Conference Paper | Accepted | English
Author
; ; ; ; ;
Abstract
Template-based information extraction generalizes over standard token-level binary relation extraction in the sense that it attempts to fill a complex template comprising multiple slots on the basis of information given in a text. In the approach presented in this paper, templates and possible fillers are defined by a given ontology. The information extraction task consists in filling these slots within a template with previously recognized entities or literal values. We cast the task as a structure prediction problem and propose a joint probabilistic model based on factor graphs to account for the interdependence in slot assignments. Inference is implemented as a heuristic building on Markov chain Monte Carlo sampling. As our main contribution, we investigate the impact of soft constraints modeled as single slot factors which measure preferences of individual slots for ranges of fillers, as well as pairwise slot factors modeling the compatibility between fillers of two slots. Instead of relying on expert knowledge to acquire such soft constraints, in our approach they are directly captured in the model and learned from training data. We show that both types of factors are effective in improving information extraction on a real-world data set of full-text papers from the biomedical domain. Pairwise factors are shown to particularly improve the performance of our extraction model by up to +0.43 points in precision, leading to an F 1 score of 0.90 for individual templates.
Publishing Year
Conference
23rd International Conference on Natural Language & Information Systems (NLDB)
Location
Paris
Conference Date
2018-06-13 – 2018-06-15
PUB-ID

Cite this

ter Horst H, Hartung M, Klinger R, Brazda N, Müller HW, Cimiano P. Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-based Information Extraction. In: Proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB). Accepted.
ter Horst, H., Hartung, M., Klinger, R., Brazda, N., Müller, H. W., & Cimiano, P. (Accepted). Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-based Information Extraction. Proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB)
ter Horst, H., Hartung, M., Klinger, R., Brazda, N., Müller, H. W., and Cimiano, P. (Accepted). “Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-based Information Extraction” in Proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB).
ter Horst, H., et al., Accepted. Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-based Information Extraction. In Proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB).
H. ter Horst, et al., “Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-based Information Extraction”, Proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB), Accepted.
ter Horst, H., Hartung, M., Klinger, R., Brazda, N., Müller, H.W., Cimiano, P.: Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-based Information Extraction. Proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB). (Accepted).
ter Horst, Hendrik, Hartung, Matthias, Klinger, Roman, Brazda, Nicole, Müller, Hans Werner, and Cimiano, Philipp. “Assessing the Impact of Single and Pairwise Slot Constraints in a Factor Graph Model for Template-based Information Extraction”. Proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB). Accepted.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
File Name
obie.pdf 442.44 KB
Access Level
OA Open Access
Last Uploaded
2018-04-10T15:23:23Z

This data publication is cited in the following publications:
This publication cites the following data publications:

Export

0 Marked Publications

Open Data PUB

Search this title in

Google Scholar