Using Speech in Visual Object Recognition

Wachsmuth, Sven; Fink, Gernot A.; Kummert, Franz; Sagerer, Gerhard

Using Speech in Visual Object Recognition

Wachsmuth S, Fink GA, Kummert F, Sagerer G (2000)
Informatik Aktuell: 428-435.

Konferenzbeitrag | Veröffentlicht | Englisch

Download

Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!

Autor*in

Wachsmuth, Sven^UniBi ; Fink, Gernot A.; Kummert, Franz^UniBi; Sagerer, Gerhard^UniBi

Herausgeber*in

Sommer, G.; Krüger, N.; Perwass, C.

Einrichtung

SFB 360 Situierte Künstliche Kommunikatoren
Technische Fakultät > AG Angewandte Informatik

Abstract / Bemerkung

Automatic understanding of multi-modal input is the central topic in modern human computer interfaces. But the basic questions about how the interpretations provided by different modalities can be connected in a universal and robust manner is still an open problem. The most intuitive input modalities, speech perception and vision, can only be correlated on a qualitative content based interpretation level. But, due to vague meanings and erroneous processing results this is extremely difficult to accomplish. A simple frame based integration scheme filling appropriate slots with new analysis results will fail when ambiguous or contradictory information appears. In this paper we propose a new probabilistic framework to overcome these drawbacks. The integration model is built up from data collected in labeled test sets and psycholinguistic experiments. Thereby, the correspondence problem is solved in a very robust and universal manner. In particular, we will show that erroneous visual interpretations can be corrected by a joint analysis of visual and speech input data.

Erscheinungsjahr

2000

Serien- oder Zeitschriftentitel

Informatik Aktuell

Seite(n)

428-435

Page URI

https://pub.uni-bielefeld.de/record/2618892

Zitieren

Wachsmuth S, Fink GA, Kummert F, Sagerer G. Using Speech in Visual Object Recognition. Informatik Aktuell. 2000:428-435.

Wachsmuth, S., Fink, G. A., Kummert, F., & Sagerer, G. (2000). Using Speech in Visual Object Recognition. Informatik Aktuell, 428-435.

Wachsmuth, Sven, Fink, Gernot A., Kummert, Franz, and Sagerer, Gerhard. 2000. “Using Speech in Visual Object Recognition”, Informatik Aktuell, , 428-435.

Wachsmuth, S., Fink, G. A., Kummert, F., and Sagerer, G. (2000). Using Speech in Visual Object Recognition. Informatik Aktuell, 428-435.

Wachsmuth, S., et al., 2000. Using Speech in Visual Object Recognition. Informatik Aktuell, , p 428-435.

S. Wachsmuth, et al., “Using Speech in Visual Object Recognition”, Informatik Aktuell, 2000, pp. 428-435.

Wachsmuth, S., Fink, G.A., Kummert, F., Sagerer, G.: Using Speech in Visual Object Recognition. Informatik Aktuell. 428-435 (2000).

Wachsmuth, Sven, Fink, Gernot A., Kummert, Franz, and Sagerer, Gerhard. “Using Speech in Visual Object Recognition”. Informatik Aktuell (2000): 428-435.

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar

PUB - Publikationen an der Universität Bielefeld

Using Speech in Visual Object Recognition

Zitieren