Ranking of disease gene associations from large corpora of scientific publications
ter Horst H (2015)
Bielefeld: Bielefeld University.
Bielefelder Masterarbeit | Englisch
Autor*in
Gutachter*in / Betreuer*in
Einrichtung
Abstract / Bemerkung
The extraction of disease-gene associations from biomedical publications is a widely inves-
tigated field of research. In previous work, a frequent method was to implement natural
language processing tools that use semantic information to find such associations. How-
ever, most of these approaches are restricted to single documents. Retrieval systems that
predict novel associations across various documents often lack the ability to deal with the
huge amount of resulting candidates. In this work, we present a system that aggregates
information from a large corpora of scientific abstracts. This information is used to build a
comprehensive gene-interaction network, which is then used to predict novel disease-gene
associations. We tackle the problem of candidate reduction by integrating two separate
machine learning methods. We train a support vector machine to classify genes as disease
related or not and a support vector regression model to rank gene-candidates according to
their importance to a specific disease. Thereto, we make use of approved methods and ex-
tend them by a novel investigation of the gene-interaction network. In a model-evaluation
on two gold standards as well as in a case-study in cooperation with biomedical experts,
it is shown that the proposed methods are able to extract disease-gene-associations from
single documents and discover disease-related candidates across multiple documents.
Stichworte
machine learning;
text mining;
biomedical literature;
graph-based features;
disease-gene associations
Jahr
2015
Seite(n)
107
Page URI
https://pub.uni-bielefeld.de/record/2776749
Zitieren
ter Horst H. Ranking of disease gene associations from large corpora of scientific publications. Bielefeld: Bielefeld University; 2015.
ter Horst, H. (2015). Ranking of disease gene associations from large corpora of scientific publications. Bielefeld: Bielefeld University.
ter Horst, Hendrik. 2015. Ranking of disease gene associations from large corpora of scientific publications. Bielefeld: Bielefeld University.
ter Horst, H. (2015). Ranking of disease gene associations from large corpora of scientific publications. Bielefeld: Bielefeld University.
ter Horst, H., 2015. Ranking of disease gene associations from large corpora of scientific publications, Bielefeld: Bielefeld University.
H. ter Horst, Ranking of disease gene associations from large corpora of scientific publications, Bielefeld: Bielefeld University, 2015.
ter Horst, H.: Ranking of disease gene associations from large corpora of scientific publications. Bielefeld University, Bielefeld (2015).
ter Horst, Hendrik. Ranking of disease gene associations from large corpora of scientific publications. Bielefeld: Bielefeld University, 2015.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Access Level
Open Access
Zuletzt Hochgeladen
2019-09-25T06:44:31Z
MD5 Prüfsumme
3735edc903317832044f92cbcda20193