Using Speech in Visual Object Recognition

Wachsmuth S, Fink GA, Kummert F, Sagerer G (2000)
Informatik Aktuell: 428-435.

Download
No fulltext has been uploaded. References only!
Conference Paper | Published | English

No fulltext has been uploaded

Editor
Sommer, G. ; Krüger, N. ; Perwass, C.
Abstract
Automatic understanding of multi-modal input is the central topic in modern human computer interfaces. But the basic questions about how the interpretations provided by different modalities can be connected in a universal and robust manner is still an open problem. The most intuitive input modalities, speech perception and vision, can only be correlated on a qualitative content based interpretation level. But, due to vague meanings and erroneous processing results this is extremely difficult to accomplish. A simple frame based integration scheme filling appropriate slots with new analysis results will fail when ambiguous or contradictory information appears. In this paper we propose a new probabilistic framework to overcome these drawbacks. The integration model is built up from data collected in labeled test sets and psycholinguistic experiments. Thereby, the correspondence problem is solved in a very robust and universal manner. In particular, we will show that erroneous visual interpretations can be corrected by a joint analysis of visual and speech input data.
Publishing Year
PUB-ID

Cite this

Wachsmuth S, Fink GA, Kummert F, Sagerer G. Using Speech in Visual Object Recognition. Informatik Aktuell. 2000:428-435.
Wachsmuth, S., Fink, G. A., Kummert, F., & Sagerer, G. (2000). Using Speech in Visual Object Recognition. Informatik Aktuell, 428-435.
Wachsmuth, S., Fink, G. A., Kummert, F., and Sagerer, G. (2000). Using Speech in Visual Object Recognition. Informatik Aktuell, 428-435.
Wachsmuth, S., et al., 2000. Using Speech in Visual Object Recognition. Informatik Aktuell, , p 428-435.
S. Wachsmuth, et al., “Using Speech in Visual Object Recognition”, Informatik Aktuell, 2000, pp. 428-435.
Wachsmuth, S., Fink, G.A., Kummert, F., Sagerer, G.: Using Speech in Visual Object Recognition. Informatik Aktuell. 428-435 (2000).
Wachsmuth, Sven, Fink, Gernot A., Kummert, Franz, and Sagerer, Gerhard. “Using Speech in Visual Object Recognition”. Informatik Aktuell (2000): 428-435.
This data publication is cited in the following publications:
This publication cites the following data publications:

Export

0 Marked Publications

Open Data PUB

Search this title in

Google Scholar