Scalable Event-based Clustering of Social Media via Record Linkage Techniques
Reuter T, Cimiano P, Drumond L, Buza K, Schmidt-Thieme L (2011)
Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona.
Konferenzbeitrag
| Veröffentlicht | Englisch
Autor*in
Einrichtung
Abstract / Bemerkung
We tackle the problem of grouping content available in social media applications such as Flickr, Youtube, Panoramino etc. into clusters of documents describing the same event. This task has been referred to as event identification before.We present a new formalization of the event identification task as a record linkage problem and show that this formulation leads to a principled and highly efficient solution to the problem. We present results on two datasets derived from Flickr – last.fm and upcoming – comparing the results in terms of Normalized Mutual Information and F-Measure with respect to several baselines, showing that a record linkage approach outperforms all baselines as well as a state-of-the-art system. We demonstrate that our approach can scale to large amounts of data, reducing the processing time considerably compared to a state-of-the-art approach. The scalability is achieved by applying an appropriate blocking strategy and relying on a Single Linkage clustering algorithm which avoids the exhaustive computation of pairwise similarities.
Erscheinungsjahr
2011
Konferenz
Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011)
Konferenzort
Barcelona
Page URI
https://pub.uni-bielefeld.de/record/2278513
Zitieren
Reuter T, Cimiano P, Drumond L, Buza K, Schmidt-Thieme L. Scalable Event-based Clustering of Social Media via Record Linkage Techniques. Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona.
Reuter, T., Cimiano, P., Drumond, L., Buza, K., & Schmidt-Thieme, L. (2011). Scalable Event-based Clustering of Social Media via Record Linkage Techniques. Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona.
Reuter, Timo, Cimiano, Philipp, Drumond, Lucas, Buza, Krisztian, and Schmidt-Thieme, Lars. 2011. “Scalable Event-based Clustering of Social Media via Record Linkage Techniques”. Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona .
Reuter, T., Cimiano, P., Drumond, L., Buza, K., and Schmidt-Thieme, L. (2011).“Scalable Event-based Clustering of Social Media via Record Linkage Techniques”. Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona.
Reuter, T., et al., 2011. Scalable Event-based Clustering of Social Media via Record Linkage Techniques. Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona.
T. Reuter, et al., “Scalable Event-based Clustering of Social Media via Record Linkage Techniques”, Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona, 2011.
Reuter, T., Cimiano, P., Drumond, L., Buza, K., Schmidt-Thieme, L.: Scalable Event-based Clustering of Social Media via Record Linkage Techniques. Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona (2011).
Reuter, Timo, Cimiano, Philipp, Drumond, Lucas, Buza, Krisztian, and Schmidt-Thieme, Lars. “Scalable Event-based Clustering of Social Media via Record Linkage Techniques”. Presented at the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), Barcelona, 2011.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Access Level
Open Access
Zuletzt Hochgeladen
2019-09-06T08:57:34Z
MD5 Prüfsumme
eb30fd06649b7b2ecbd41151b8f48295