Computational Audiovisual Scene Analysis

Yan, Rujiao

Computational Audiovisual Scene Analysis

Yan R (2014)
Bielefeld: Universitätsbibliothek Bielefeld.

Bielefelder E-Dissertation | Englisch

Download

thesis.pdf

URN

urn:nbn:de:hbz:361-26953860

Autor*in

Yan, Rujiao

Gutachter*in / Betreuer*in

Wrede, Britta^UniBi; Rodemann, Tobias

Einrichtung

Technische Fakultät

Abstract / Bemerkung

In most real-world situations, a robot is interacting with multiple people. In this case, understanding of the dialogs is essential. However, dialog scene analysis is missing in most existing systems of human-robot interaction. In such systems, only one speaker can talk with the robot or each speaker wears an attached microphone or a headset. The target of Computational AudioVisual Scene Analysis (CAVSA) is therefore making dialogs between humans and robots more natural and flexible. The CAVSA system is able to learn how many speakers are in the scenario, where the speakers are and who is currently speaking. CAVSA is a challenging task due to the complexity of dialogue scenarios. First, speakers are unknown in advance, thus a database for training high-level features beforehand to recognize faces or voices is not available. Second, people can dynamically come into and leave the scene, may move all the time and even change their locations outside the camera field of view. Third, the robot can not see all the people at the same time due to limited camera field of view and head movements. Moreover, a sound could be related to a person who stands outside the camera field of view and has never been seen. I will show that the CAVSA system is able to assign words to corresponding speakers. A speaker is recognized again when he leaves and enters the scene, or changes his position even with a newly appearing person.

Stichworte

Multimodel Interface; Audiovisual Integration; Scene Analysis; Human-Robot Interaction

Jahr

2014

Page URI

https://pub.uni-bielefeld.de/record/2695386

Zitieren

Yan R. Computational Audiovisual Scene Analysis. Bielefeld: Universitätsbibliothek Bielefeld; 2014.

Yan, R. (2014). Computational Audiovisual Scene Analysis. Bielefeld: Universitätsbibliothek Bielefeld.

Yan, Rujiao. 2014. Computational Audiovisual Scene Analysis. Bielefeld: Universitätsbibliothek Bielefeld.

Yan, R. (2014). Computational Audiovisual Scene Analysis. Bielefeld: Universitätsbibliothek Bielefeld.

Yan, R., 2014. Computational Audiovisual Scene Analysis, Bielefeld: Universitätsbibliothek Bielefeld.

R. Yan, Computational Audiovisual Scene Analysis, Bielefeld: Universitätsbibliothek Bielefeld, 2014.

Yan, R.: Computational Audiovisual Scene Analysis. Universitätsbibliothek Bielefeld, Bielefeld (2014).

Yan, Rujiao. Computational Audiovisual Scene Analysis. Bielefeld: Universitätsbibliothek Bielefeld, 2014.

Alle Dateien verfügbar unter der/den folgenden Lizenz(en):

Copyright Statement:

Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]

Volltext(e)

Name

thesis.pdf

Access Level

Open Access

Zuletzt Hochgeladen

2019-09-06T09:18:26Z

MD5 Prüfsumme

3994ef7fb984af60f767e8c16542923b

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar

PUB - Publikationen an der Universität Bielefeld

Computational Audiovisual Scene Analysis

Zitieren