Using Speech in Visual Object Recognition

Wachsmuth S, Fink GA, Kummert F, Sagerer G (2000)
Informatik Aktuell: 428-435.

Konferenzbeitrag | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Herausgeber*in
Sommer, G.; Krüger, N.; Perwass, C.
Abstract / Bemerkung
Automatic understanding of multi-modal input is the central topic in modern human computer interfaces. But the basic questions about how the interpretations provided by different modalities can be connected in a universal and robust manner is still an open problem. The most intuitive input modalities, speech perception and vision, can only be correlated on a qualitative content based interpretation level. But, due to vague meanings and erroneous processing results this is extremely difficult to accomplish. A simple frame based integration scheme filling appropriate slots with new analysis results will fail when ambiguous or contradictory information appears. In this paper we propose a new probabilistic framework to overcome these drawbacks. The integration model is built up from data collected in labeled test sets and psycholinguistic experiments. Thereby, the correspondence problem is solved in a very robust and universal manner. In particular, we will show that erroneous visual interpretations can be corrected by a joint analysis of visual and speech input data.
Erscheinungsjahr
2000
Serien- oder Zeitschriftentitel
Informatik Aktuell
Seite(n)
428-435
Page URI
https://pub.uni-bielefeld.de/record/2618892

Zitieren

Wachsmuth S, Fink GA, Kummert F, Sagerer G. Using Speech in Visual Object Recognition. Informatik Aktuell. 2000:428-435.
Wachsmuth, S., Fink, G. A., Kummert, F., & Sagerer, G. (2000). Using Speech in Visual Object Recognition. Informatik Aktuell, 428-435.
Wachsmuth, Sven, Fink, Gernot A., Kummert, Franz, and Sagerer, Gerhard. 2000. “Using Speech in Visual Object Recognition”, Informatik Aktuell, , 428-435.
Wachsmuth, S., Fink, G. A., Kummert, F., and Sagerer, G. (2000). Using Speech in Visual Object Recognition. Informatik Aktuell, 428-435.
Wachsmuth, S., et al., 2000. Using Speech in Visual Object Recognition. Informatik Aktuell, , p 428-435.
S. Wachsmuth, et al., “Using Speech in Visual Object Recognition”, Informatik Aktuell, 2000, pp. 428-435.
Wachsmuth, S., Fink, G.A., Kummert, F., Sagerer, G.: Using Speech in Visual Object Recognition. Informatik Aktuell. 428-435 (2000).
Wachsmuth, Sven, Fink, Gernot A., Kummert, Franz, and Sagerer, Gerhard. “Using Speech in Visual Object Recognition”. Informatik Aktuell (2000): 428-435.
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar