Less Destructive Cleaning of Web Documents by Using Standoff Annotation

Stührenberg M (2014)
In: Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics. Bildhauer F, Schäfer R (Eds); Gothenburg, Sweden: Association for Computational Linguistics: 16-21.

Konferenzbeitrag | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Herausgeber*in
Bildhauer, Felix; Schäfer, Roland
Abstract / Bemerkung
Standoff annotation, that is, the separa-tion of primary data and markup, canbe an interesting option to annotate webpages since it does not demand the re-moval of annotations already present inweb pages. We will present a standoff se-rialization that allows for annotating well-formed web pages with multiple annota-tion layers in a single instance, easing pro-cessing and analyzing of the data.
Erscheinungsjahr
2014
Titel des Konferenzbandes
Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics
Seite(n)
16-21
Page URI
https://pub.uni-bielefeld.de/record/2941369

Zitieren

Stührenberg M. Less Destructive Cleaning of Web Documents by Using Standoff Annotation. In: Bildhauer F, Schäfer R, eds. Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics. Gothenburg, Sweden: Association for Computational Linguistics; 2014: 16-21.
Stührenberg, M. (2014). Less Destructive Cleaning of Web Documents by Using Standoff Annotation. In F. Bildhauer & R. Schäfer (Eds.), Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics (pp. 16-21). Gothenburg, Sweden: Association for Computational Linguistics. https://doi.org/10.3115/v1/W14-0403
Stührenberg, M. (2014). “Less Destructive Cleaning of Web Documents by Using Standoff Annotation” in Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics, Bildhauer, F., and Schäfer, R. eds. (Gothenburg, Sweden: Association for Computational Linguistics), 16-21.
Stührenberg, M., 2014. Less Destructive Cleaning of Web Documents by Using Standoff Annotation. In F. Bildhauer & R. Schäfer, eds. Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics. Gothenburg, Sweden: Association for Computational Linguistics, pp. 16-21.
M. Stührenberg, “Less Destructive Cleaning of Web Documents by Using Standoff Annotation”, Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics, F. Bildhauer and R. Schäfer, eds., Gothenburg, Sweden: Association for Computational Linguistics, 2014, pp.16-21.
Stührenberg, M.: Less Destructive Cleaning of Web Documents by Using Standoff Annotation. In: Bildhauer, F. and Schäfer, R. (eds.) Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics. p. 16-21. Association for Computational Linguistics, Gothenburg, Sweden (2014).
Stührenberg, Maik. “Less Destructive Cleaning of Web Documents by Using Standoff Annotation”. Proceedings of the 9th Web as Corpus Workshop (WaC-9). Held in conjunction with the 14th Conference of the European Chapter of the Association for Computational Linguistics. Ed. Felix Bildhauer and Roland Schäfer. Gothenburg, Sweden: Association for Computational Linguistics, 2014. 16-21.
Link(s) zu Volltext(en)
Access Level
Restricted Closed Access

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar