Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR

Krome N, Kopp S (2024)
In: Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1. Piscataway, NJ: IEEE: 236-240.

Kurzbeitrag Konferenz / Poster | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Abstract / Bemerkung
Social XR applications usually require advanced tracking equipment to control one’s own avatar. We explore if AI-based co-speech gesture generation techniques can be employed to compensate for the lack of tracking hardware that many users face. One main challenge is to achieve convincing behavior quality without introducing too much latency. Previous work has shown that both depend – in opposite ways – on the length of the audio chunk the gestures are generated from, and that gesture quality of existing models declines with lower chunk sizes while still not reaching sufficiently low latency to enable fluent interaction. In this paper we present an approach that is able to generate continuous gesture trajectories frame by frame, minimizing latency and yielding delays well below buffer sizes of voice communication systems or video calls. A project page with videos of the generated gestures is available at https://nkrome.github.io/FrameCAGE.html.
Erscheinungsjahr
2024
Titel des Konferenzbandes
Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1
Seite(n)
236-240
Konferenz
AIxVR '6: IEEE International Conference on Artificial Intelligence & extended and Virtual Reality
Konferenzort
Los Angeles, USA
Konferenzdatum
2024-01-17 – 2024-01-19
eISBN
979-8-3503-7202-1
Page URI
https://pub.uni-bielefeld.de/record/2987076

Zitieren

Krome N, Kopp S. Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR. In: Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1. Piscataway, NJ: IEEE; 2024: 236-240.
Krome, N., & Kopp, S. (2024). Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR. Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1, 236-240. Piscataway, NJ: IEEE. https://doi.org/10.1109/AIxVR59861.2024.00038
Krome, Niklas, and Kopp, Stefan. 2024. “Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR”. In Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1, 236-240. Piscataway, NJ: IEEE.
Krome, N., and Kopp, S. (2024). “Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR” in Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1 (Piscataway, NJ: IEEE), 236-240.
Krome, N., & Kopp, S., 2024. Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR. In Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1. Piscataway, NJ: IEEE, pp. 236-240.
N. Krome and S. Kopp, “Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR”, Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1, Piscataway, NJ: IEEE, 2024, pp.236-240.
Krome, N., Kopp, S.: Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR. Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1. p. 236-240. IEEE, Piscataway, NJ (2024).
Krome, Niklas, and Kopp, Stefan. “Minimal Latency Speech-Driven Gesture Generation for Continuous Interaction in Social XR”. Proceedings of the 2024 IEEE International Conference on Artificial Intelligence and eXtended and Virtual Reality (AIxVR). Vol. 1. Piscataway, NJ: IEEE, 2024. 236-240.
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar