Integration of Vision and Speech Understanding Using Bayesian Networks

Wachsmuth S, Socher G, Brandt-Pook H, Kummert F, Sagerer G (2000)
Videre: A Journal of Computer Vision Research 1(4): 61-83.

Zeitschriftenaufsatz | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
forms.journal_article.field.editor_solo.label
Brown, Chris M.; Sandini, Giulio
Abstract / Bemerkung
The interaction of image and speech processing is a crucial property of multimedia systems. Classical systems using inferences on pure qualitative high-level descriptions miss much information when concerned with erroneous, vague, or incomplete data. We propose a new architecture that integrates various levels of processing by using multiple representations of the visually observed scene. The representations are vertically connected by Bayesian networks in order to find the most plausible interpretation of the scene. The interpretation of a spoken utterance naming an object in the visually observed scene is modeled as another partial representation of the scene. Using this concept, the key problem is the identification of the verbally specified object instances in the visually observed scene. Therefore, a Bayesian network is generated dynamically from the spoken utterance and the visual scene representation.
Erscheinungsjahr
2000
Zeitschriftentitel
Videre: A Journal of Computer Vision Research
Band
1
Ausgabe
4
Seite(n)
61-83
ISSN
1089-2788
Page URI
https://pub.uni-bielefeld.de/record/1889160

Zitieren

Wachsmuth S, Socher G, Brandt-Pook H, Kummert F, Sagerer G. Integration of Vision and Speech Understanding Using Bayesian Networks. Videre: A Journal of Computer Vision Research. 2000;1(4):61-83.
Wachsmuth, S., Socher, G., Brandt-Pook, H., Kummert, F., & Sagerer, G. (2000). Integration of Vision and Speech Understanding Using Bayesian Networks. Videre: A Journal of Computer Vision Research, 1(4), 61-83.
Wachsmuth, Sven, Socher, Gudrun, Brandt-Pook, Hans, Kummert, Franz, and Sagerer, Gerhard. 2000. “Integration of Vision and Speech Understanding Using Bayesian Networks”. Videre: A Journal of Computer Vision Research 1 (4): 61-83.
Wachsmuth, S., Socher, G., Brandt-Pook, H., Kummert, F., and Sagerer, G. (2000). Integration of Vision and Speech Understanding Using Bayesian Networks. Videre: A Journal of Computer Vision Research 1, 61-83.
Wachsmuth, S., et al., 2000. Integration of Vision and Speech Understanding Using Bayesian Networks. Videre: A Journal of Computer Vision Research, 1(4), p 61-83.
S. Wachsmuth, et al., “Integration of Vision and Speech Understanding Using Bayesian Networks”, Videre: A Journal of Computer Vision Research, vol. 1, 2000, pp. 61-83.
Wachsmuth, S., Socher, G., Brandt-Pook, H., Kummert, F., Sagerer, G.: Integration of Vision and Speech Understanding Using Bayesian Networks. Videre: A Journal of Computer Vision Research. 1, 61-83 (2000).
Wachsmuth, Sven, Socher, Gudrun, Brandt-Pook, Hans, Kummert, Franz, and Sagerer, Gerhard. “Integration of Vision and Speech Understanding Using Bayesian Networks”. Videre: A Journal of Computer Vision Research 1.4 (2000): 61-83.
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar