Integrated Analysis of Speech and Images as a Probabilistic Decoding Process
Wachsmuth S, Sagerer G (2002)
In: Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002). Québec City, Québec, Canada: IEEE: 588-592.
Konferenzbeitrag
| Veröffentlicht | Englisch
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Einrichtung
Abstract / Bemerkung
Speech understanding and vision are the two most important modalities in human-human communication. However, the emulation of these by a computer faces fundamental difficulties due to noisy data, vague meanings, previously unseen objects or unheard words, occlusions, spontaneous speech effects, and context dependence. Thus, the interpretation processes on both channels are highly error-prone. This paper presents a new perspective on the problem of relating speech and image interpretations as a probabilistic decoding process. It is shown that such an integration scheme is robust regarding partial or erroneous interpretations. Furthermore, it is shown that implicit error correction strategies can be formulated in this probabilistic framework that lead to improved scene interpretation.
Erscheinungsjahr
2002
Titel des Konferenzbandes
Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002)
Seite(n)
588-592
Page URI
https://pub.uni-bielefeld.de/record/2618720
Zitieren
Wachsmuth S, Sagerer G. Integrated Analysis of Speech and Images as a Probabilistic Decoding Process. In: Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002). Québec City, Québec, Canada: IEEE; 2002: 588-592.
Wachsmuth, S., & Sagerer, G. (2002). Integrated Analysis of Speech and Images as a Probabilistic Decoding Process. Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002), 588-592
Wachsmuth, Sven, and Sagerer, Gerhard. 2002. “Integrated Analysis of Speech and Images as a Probabilistic Decoding Process”. In Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002), 588-592. Québec City, Québec, Canada: IEEE.
Wachsmuth, S., and Sagerer, G. (2002). “Integrated Analysis of Speech and Images as a Probabilistic Decoding Process” in Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002) (Québec City, Québec, Canada: IEEE), 588-592.
Wachsmuth, S., & Sagerer, G., 2002. Integrated Analysis of Speech and Images as a Probabilistic Decoding Process. In Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002). Québec City, Québec, Canada: IEEE, pp. 588-592.
S. Wachsmuth and G. Sagerer, “Integrated Analysis of Speech and Images as a Probabilistic Decoding Process”, Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002), Québec City, Québec, Canada: IEEE, 2002, pp.588-592.
Wachsmuth, S., Sagerer, G.: Integrated Analysis of Speech and Images as a Probabilistic Decoding Process. Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002). p. 588-592. IEEE, Québec City, Québec, Canada (2002).
Wachsmuth, Sven, and Sagerer, Gerhard. “Integrated Analysis of Speech and Images as a Probabilistic Decoding Process”. Proc. of 16th Int. Conf. on Pattern Recognition (ICPR’2002). Québec City, Québec, Canada: IEEE, 2002. 588-592.