Bayesian Networks for Speech and Image Integration

Wachsmuth, Sven; Sagerer, Gerhard

Bayesian Networks for Speech and Image Integration

Wachsmuth S, Sagerer G (2002)
In: Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002). Edmonton, Alberta, Canada: 300-306.

Konferenzbeitrag | Veröffentlicht | Englisch

Download

Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!

Autor*in

Wachsmuth, Sven^UniBi ; Sagerer, Gerhard^UniBi

Einrichtung

Technische Fakultät > AG Angewandte Informatik
SFB 360 Situierte Künstliche Kommunikatoren

Abstract / Bemerkung

The realization of natural human-computer interfaces suffers from a wide range of restrictions concerning noisy data, vague meanings, and context dependence. An essential aspect of everyday communication is the ability of humans to ground verbal interpretations in visual perception. Thus, the system has to be able to solve the correspondence problem of relating verbal and visual descriptions of the same object. This contribution proposes a new and innovative solution to this problem using Bayesian networks. In order to capture vague meanings of adjectives used by the speaker, psycholinguistic experiments are evaluated. Object recognition errors are taken into account by conditional probabilities estimated on test sets. The Bayesian network is dynamically built up from verbal object description and is evaluated by an inference technique combining bucket elimination and conditioning. Results show that speech and image data is interpreted more robustly in the combined case than in the case of isolated interpretations.

Erscheinungsjahr

2002

Titel des Konferenzbandes

Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002)

Seite(n)

300-306

Page URI

https://pub.uni-bielefeld.de/record/2618698

Zitieren

Wachsmuth S, Sagerer G. Bayesian Networks for Speech and Image Integration. In: Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002). Edmonton, Alberta, Canada; 2002: 300-306.

Wachsmuth, S., & Sagerer, G. (2002). Bayesian Networks for Speech and Image Integration. Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002), 300-306

Wachsmuth, Sven, and Sagerer, Gerhard. 2002. “Bayesian Networks for Speech and Image Integration”. In Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002), 300-306. Edmonton, Alberta, Canada.

Wachsmuth, S., and Sagerer, G. (2002). “Bayesian Networks for Speech and Image Integration” in Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002) (Edmonton, Alberta, Canada), 300-306.

Wachsmuth, S., & Sagerer, G., 2002. Bayesian Networks for Speech and Image Integration. In Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002). Edmonton, Alberta, Canada, pp. 300-306.

S. Wachsmuth and G. Sagerer, “Bayesian Networks for Speech and Image Integration”, Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002), Edmonton, Alberta, Canada: 2002, pp.300-306.

Wachsmuth, S., Sagerer, G.: Bayesian Networks for Speech and Image Integration. Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002). p. 300-306. Edmonton, Alberta, Canada (2002).

Wachsmuth, Sven, and Sagerer, Gerhard. “Bayesian Networks for Speech and Image Integration”. Proc. of 18th National Conf. on Artificial Intelligence (AAAI-2002). Edmonton, Alberta, Canada, 2002. 300-306.

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar

PUB - Publikationen an der Universität Bielefeld

Bayesian Networks for Speech and Image Integration

Zitieren