Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery
Gibbon D, Hughes B, Trippel T (2006)
In: From Data and Information Analysis to Knowledge Engineering. Spiliopoulou M, Kruse R, Borgelt C, Nürnberger A, Gaul W (Eds); Studies in Classification, Data Analysis, and Knowledge Organization. Berlin/Heidelberg: Springer-Verlag: 366-373.
Sammelwerksbeitrag
| Veröffentlicht | Englisch
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Herausgeber*in
Spiliopoulou, Myra;
Kruse, Rudolf;
Borgelt, Christian;
Nürnberger, Andreas;
Gaul, Wolfgang
Abstract / Bemerkung
Analysis and knowledge representation of linguistic objects tends to focus on larger units (e.g. words) than print medium characters. We analyse characters as linguistic objects in their own right, with meaning, structure and form. Characters have meaning (the symbols of the International Phonetic Alphabet denote phonetic categories, the character represented by the glyph ‘∪’ denotes set union), structure (they are composed of stems and parts such as descenders or diacritics or are ligatures), and form (they have a mapping to visual glyphs). Character encoding initatives such as Unicode tend to concentrate on the structure and form of characters and ignore their meaning in the sense discussed here. We suggest that our approach of including semantic decomposition and defining font-based namespaces for semantic character domains provides a long-term perspective of interoperability and tractability with regard to data-mining over characters by integrating information about characters into a coherent semiotically-based ontology. We demonstrate these principles in a case study of the International Phonetic Alphabet.
Erscheinungsjahr
2006
Buchtitel
From Data and Information Analysis to Knowledge Engineering
Serientitel
Studies in Classification, Data Analysis, and Knowledge Organization
Seite(n)
366-373
ISBN
3-540-31313-3
Page URI
https://pub.uni-bielefeld.de/record/2955679
Zitieren
Gibbon D, Hughes B, Trippel T. Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery. In: Spiliopoulou M, Kruse R, Borgelt C, Nürnberger A, Gaul W, eds. From Data and Information Analysis to Knowledge Engineering. Studies in Classification, Data Analysis, and Knowledge Organization. Berlin/Heidelberg: Springer-Verlag; 2006: 366-373.
Gibbon, D., Hughes, B., & Trippel, T. (2006). Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery. In M. Spiliopoulou, R. Kruse, C. Borgelt, A. Nürnberger, & W. Gaul (Eds.), Studies in Classification, Data Analysis, and Knowledge Organization. From Data and Information Analysis to Knowledge Engineering (pp. 366-373). Berlin/Heidelberg: Springer-Verlag. https://doi.org/10.1007/3-540-31314-1_44
Gibbon, Dafydd, Hughes, Baden, and Trippel, Thorsten. 2006. “Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery”. In From Data and Information Analysis to Knowledge Engineering, ed. Myra Spiliopoulou, Rudolf Kruse, Christian Borgelt, Andreas Nürnberger, and Wolfgang Gaul, 366-373. Studies in Classification, Data Analysis, and Knowledge Organization. Berlin/Heidelberg: Springer-Verlag.
Gibbon, D., Hughes, B., and Trippel, T. (2006). “Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery” in From Data and Information Analysis to Knowledge Engineering, Spiliopoulou, M., Kruse, R., Borgelt, C., Nürnberger, A., and Gaul, W. eds. Studies in Classification, Data Analysis, and Knowledge Organization (Berlin/Heidelberg: Springer-Verlag), 366-373.
Gibbon, D., Hughes, B., & Trippel, T., 2006. Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery. In M. Spiliopoulou, et al., eds. From Data and Information Analysis to Knowledge Engineering. Studies in Classification, Data Analysis, and Knowledge Organization. Berlin/Heidelberg: Springer-Verlag, pp. 366-373.
D. Gibbon, B. Hughes, and T. Trippel, “Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery”, From Data and Information Analysis to Knowledge Engineering, M. Spiliopoulou, et al., eds., Studies in Classification, Data Analysis, and Knowledge Organization, Berlin/Heidelberg: Springer-Verlag, 2006, pp.366-373.
Gibbon, D., Hughes, B., Trippel, T.: Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery. In: Spiliopoulou, M., Kruse, R., Borgelt, C., Nürnberger, A., and Gaul, W. (eds.) From Data and Information Analysis to Knowledge Engineering. Studies in Classification, Data Analysis, and Knowledge Organization. p. 366-373. Springer-Verlag, Berlin/Heidelberg (2006).
Gibbon, Dafydd, Hughes, Baden, and Trippel, Thorsten. “Semantic Decomposition of Character Encodings for Linguistic Knowledge Discovery”. From Data and Information Analysis to Knowledge Engineering. Ed. Myra Spiliopoulou, Rudolf Kruse, Christian Borgelt, Andreas Nürnberger, and Wolfgang Gaul. Berlin/Heidelberg: Springer-Verlag, 2006. Studies in Classification, Data Analysis, and Knowledge Organization. 366-373.