RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations

Hartung M, Zwick M (2014) : Bielefeld University. doi:10.4119/unibi/2673424.

Download
OA
Datenpublikation | Englisch
Creator
;
Abstract / Bemerkung
We release the RAMBO 800+ corpus providing manual annotations for Rare and AMBiguOus abbreviations of gene names in about 800 MEDLINE abstracts. It can be used to train gene recognition systems for this class of abbreviations, as discussed in Hartung et al. (BioNLP 2014). The corpus covers eight gene name abbreviation types: AHR, CLI, CLU, COPD, HF, MOX, PLS, SAH. For each of these types, 100 (in case of MOX: 81) abstracts have been randomly sampled from MEDLINE. In each of these abstracts, every mention of an abbreviation of interest has been manually annotated as denoting a gene/protein or not. Plus, all other tokens in the 800 abstracts have been annotated in the same way.
Erscheinungsjahr
Data Re-Use License
This RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations is made available under the Open Data Commons Attribution License: http://opendatacommons.org/licenses/by/1.0
PUB-ID

Zitieren

Hartung M, Zwick M. (2014): RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations. Bielefeld University. doi:10.4119/unibi/2673424.
Hartung, M., & Zwick, M. (2014). RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations. Bielefeld University. doi:10.4119/unibi/2673424
Hartung, M., and Zwick, M. (2014). RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations. Bielefeld University. doi:10.4119/unibi/2673424.
Hartung, M., & Zwick, M., 2014. RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations. Bielefeld University. doi:10.4119/unibi/2673424
M. Hartung and M. Zwick, RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations. Bielefeld University, 2014. doi:10.4119/unibi/2673424.
Hartung, M., Zwick, M.: RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations. Bielefeld University (2014). doi:10.4119/unibi/2673424.
Hartung, Matthias, and Zwick, Matthias. RAMBO 800+: A Corpus for the Development of Gene/Protein Recognition from Rare and Ambiguous Abbreviations. Bielefeld University, 2014. doi:10.4119/unibi/2673424
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Volltext(e)
Access Level
OA Open Access
Zuletzt Hochgeladen
2014-07-31T16:50:28Z

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar