An Empirical Evaluation of Resources for the Identification of Diseases and Adverse Effects in Biomedical Literature

Gurulingappa H, Klinger R, Hofmann-Apitius M, Fluck J (2010)
In: 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference).

Download
OA
Konferenzbeitrag | Veröffentlicht | Englisch
Autor
; ; ;
Abstract / Bemerkung
The mentions of human health perturbations such as the diseases and adverse effects denote a special entity class in the biomedical literature. They help in understanding the underlying risk factors and develop a preventive rationale. The recognition of these named entities in texts through dictionary-based approaches relies on the availability of appropriate terminological resources. Although few resources are publicly available, not all are suitable for the text mining needs. Therefore, this work provides an overview of the well known resources with respect to human diseases and adverse effects such as the MeSH, MedDRA, ICD-10, SNOMED CT, and UMLS. Individual dictionaries are generated from these resources and their performance in recognizing the named entities is evaluated over a manually annotated corpus. In addition, the steps for curating the dictionaries, rule-based acronym disambiguation and their impact on the dictionary performance is discussed. The results show that the MedDRA and UMLS achieve the best recall. Besides this, MedDRA provides an additional benefit of achieving a higher precision. The combination of search results of all the dictionaries achieve a considerably high recall. The corpus is available on http://www.scai.fraunhofer.de/disease-ae-corpus.html
Erscheinungsjahr
Titel des Konferenzbandes
2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference)
Konferenzort
Valetta, Malta
PUB-ID

Zitieren

Gurulingappa H, Klinger R, Hofmann-Apitius M, Fluck J. An Empirical Evaluation of Resources for the Identification of Diseases and Adverse Effects in Biomedical Literature. In: 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference). 2010.
Gurulingappa, H., Klinger, R., Hofmann-Apitius, M., & Fluck, J. (2010). An Empirical Evaluation of Resources for the Identification of Diseases and Adverse Effects in Biomedical Literature. 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference)
Gurulingappa, H., Klinger, R., Hofmann-Apitius, M., and Fluck, J. (2010). “An Empirical Evaluation of Resources for the Identification of Diseases and Adverse Effects in Biomedical Literature” in 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference).
Gurulingappa, H., et al., 2010. An Empirical Evaluation of Resources for the Identification of Diseases and Adverse Effects in Biomedical Literature. In 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference).
H. Gurulingappa, et al., “An Empirical Evaluation of Resources for the Identification of Diseases and Adverse Effects in Biomedical Literature”, 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference), 2010.
Gurulingappa, H., Klinger, R., Hofmann-Apitius, M., Fluck, J.: An Empirical Evaluation of Resources for the Identification of Diseases and Adverse Effects in Biomedical Literature. 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference). (2010).
Gurulingappa, Harsha, Klinger, Roman, Hofmann-Apitius, Martin, and Fluck, Juliane. “An Empirical Evaluation of Resources for the Identification of Diseases and Adverse Effects in Biomedical Literature”. 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference). 2010.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Volltext(e)
Access Level
OA Open Access
Zuletzt Hochgeladen
2013-09-26T22:01:33Z

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar