Using Speech in Visual Object Recognition

Wachsmuth S, Fink GA, Kummert F, Sagerer G (2000)
In: Informatik Aktuell. Sommer G, Krüger N, Perwass C (Eds);Springer: 428-435.

Conference Paper | Published | English

No fulltext has been uploaded

Editor
Sommer, G. ; Krüger, N. ; Perwass, C.
Abstract
Automatic understanding of multi-modal input is the central topic in modern human computer interfaces. But the basic questions about how the interpretations provided by different modalities can be connected in a universal and robust manner is still an open problem. The most intuitive input modalities, speech perception and vision, can only be correlated on a qualitative content based interpretation level. But, due to vague meanings and erroneous processing results this is extremely difficult to accomplish. A simple frame based integration scheme filling appropriate slots with new analysis results will fail when ambiguous or contradictory information appears. In this paper we propose a new probabilistic framework to overcome these drawbacks. The integration model is built up from data collected in labeled test sets and psycholinguistic experiments. Thereby, the correspondence problem is solved in a very robust and universal manner. In particular, we will show that erroneous visual interpretations can be corrected by a joint analysis of visual and speech input data.
Publishing Year
PUB-ID

Cite this

Wachsmuth S, Fink GA, Kummert F, Sagerer G. Using Speech in Visual Object Recognition. In: Sommer G, Krüger N, Perwass C, eds. Informatik Aktuell. Springer; 2000: 428-435.
Wachsmuth, S., Fink, G. A., Kummert, F., & Sagerer, G. (2000). Using Speech in Visual Object Recognition. In G. Sommer, N. Krüger, & C. Perwass (Eds.), Informatik Aktuell (pp. 428-435). Springer.
Wachsmuth, S., Fink, G. A., Kummert, F., and Sagerer, G. (2000). “Using Speech in Visual Object Recognition” in Informatik Aktuell, ed. G. Sommer, N. Krüger, and C. Perwass (Springer), 428-435.
Wachsmuth, S., et al., 2000. Using Speech in Visual Object Recognition. In G. Sommer, N. Krüger, & C. Perwass, eds. Informatik Aktuell. Springer, pp. 428-435.
S. Wachsmuth, et al., “Using Speech in Visual Object Recognition”, Informatik Aktuell, G. Sommer, N. Krüger, and C. Perwass, eds., Springer, 2000, pp.428-435.
Wachsmuth, S., Fink, G.A., Kummert, F., Sagerer, G.: Using Speech in Visual Object Recognition. In: Sommer, G., Krüger, N., and Perwass, C. (eds.) Informatik Aktuell. p. 428-435. Springer (2000).
Wachsmuth, Sven, Fink, Gernot A., Kummert, Franz, and Sagerer, Gerhard. “Using Speech in Visual Object Recognition”. Informatik Aktuell. Ed. G. Sommer, N. Krüger, and C. Perwass. Springer, 2000. 428-435.
This data publication is cited in the following publications:
This publication cites the following data publications:

Export

0 Marked Publications

Open Data PUB

Search this title in

Google Scholar