The USAGE review corpus for fine-grained, multi-lingual opinion analysis

Klinger R, Cimiano P (In Press)
In: Proceedings of the Language Resources and Evaluation Conference. Reykjavik. Iceland.

Download
OA
Conference Paper | In Press | English
Abstract
Opinion mining has received wide attention in recent years. Models for this task are typically trained or evaluated with a manually annotated dataset. However, fine-grained annotation of sentiments including information about aspects and their evaluation is very labour-intensive. The data available so far is limited. Contributing to this situation, this paper describes the Bielefeld University Sentiment Analysis Corpus for German and English (USAGE), which we offer freely to the community and which contains the annotation of product reviews from Amazon with both aspects and subjective phrases. It provides information on segments in the text which denote an aspect or a subjective evaluative phrase which refers to the aspect. Relations and coreferences are explicitly annotated. This dataset contains 622 English and 611 German reviews, allowing to investigate how to port sentiment analysis systems across languages and domains. We describe the methodology how the corpus was created and provide statistics including inter-annotator agreement. We further provide figures for a baseline system and results for German and English as well as in a cross-domain setting. The results are encouraging in that they show that aspects and phrases can be extracted robustly without the need of tuning to a particular type of products.
Publishing Year
PUB-ID

Cite this

Klinger R, Cimiano P. The USAGE review corpus for fine-grained, multi-lingual opinion analysis. In: Proceedings of the Language Resources and Evaluation Conference. Reykjavik. Iceland.; In Press.
Klinger, R., & Cimiano, P. (In Press). The USAGE review corpus for fine-grained, multi-lingual opinion analysis. Proceedings of the Language Resources and Evaluation Conference.
Klinger, R., and Cimiano, P. (In Press). “The USAGE review corpus for fine-grained, multi-lingual opinion analysis” in Proceedings of the Language Resources and Evaluation Conference (Reykjavik. Iceland.).
Klinger, R., & Cimiano, P., In Press. The USAGE review corpus for fine-grained, multi-lingual opinion analysis. In Proceedings of the Language Resources and Evaluation Conference. Reykjavik. Iceland.
R. Klinger and P. Cimiano, “The USAGE review corpus for fine-grained, multi-lingual opinion analysis”, Proceedings of the Language Resources and Evaluation Conference, Reykjavik. Iceland.: In Press.
Klinger, R., Cimiano, P.: The USAGE review corpus for fine-grained, multi-lingual opinion analysis. Proceedings of the Language Resources and Evaluation Conference. Reykjavik. Iceland. (In Press).
Klinger, Roman, and Cimiano, Philipp. “The USAGE review corpus for fine-grained, multi-lingual opinion analysis”. Proceedings of the Language Resources and Evaluation Conference. Reykjavik. Iceland., In Press.
Main File(s)
Access Level
OA Open Access
Last Uploaded
2014-04-02 18:48:19

This data publication is cited in the following publications:
This publication cites the following data publications:
2664575

Export

0 Marked Publications

Open Data PUB

Search this title in

Google Scholar