Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR
Krome N, Kopp S (2023)
In: Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23). New York, NY, USA: ACM: 1-8.
Konferenzbeitrag
| Veröffentlicht | Englisch
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Einrichtung
Abstract / Bemerkung
Extended Reality (XR) has a potential to allow social interaction for people that are distant from one another, in educational, clinical or co-working applications, as well as for scientific studies. However, a full-blown embodied social presence and interaction via avatars in XR requires motion tracking hardware that many users do not have. At the same time, modern machine learning approaches enable the synthesis of natural and life-like nonverbal behavior, but only in offline settings and with considerable lag. We evaluate the applicability of current gesture generation systems for online interaction in social XR. We define a set of requirements for real-time-capable gesture generation and propose an approach to employ a state-of-the-art model in a real-time XR interaction pipeline. To test the model under conditions of online interaction, we divide an input audio stream into chunks of different lengths and stitch the resulting gesture animations together to form continuous motion. We evaluate the quality of the resulting multimodal avatar behavior in a user study. Our results show a significant trade-off between real-time generation capabilities and gesture quality. Suggestions for future improvement to retain model performance during online interaction in Social XR are made. A project page with videos of the generated gestures is available at https://nkrome.github.io/CAGE.html.
Erscheinungsjahr
2023
Titel des Konferenzbandes
Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23)
Seite(n)
1-8
Konferenz
IVA '23: ACM International Conference on Intelligent Virtual Agents
Konferenzort
Würzburg Germany
Konferenzdatum
2023-09-19 – 2023-09-22
Page URI
https://pub.uni-bielefeld.de/record/2985532
Zitieren
Krome N, Kopp S. Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR. In: Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23). New York, NY, USA: ACM; 2023: 1-8.
Krome, N., & Kopp, S. (2023). Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR. Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23), 1-8. New York, NY, USA: ACM. https://doi.org/10.1145/3570945.3607315
Krome, Niklas, and Kopp, Stefan. 2023. “Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR”. In Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23), 1-8. New York, NY, USA: ACM.
Krome, N., and Kopp, S. (2023). “Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR” in Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23) (New York, NY, USA: ACM), 1-8.
Krome, N., & Kopp, S., 2023. Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR. In Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23). New York, NY, USA: ACM, pp. 1-8.
N. Krome and S. Kopp, “Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR”, Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23), New York, NY, USA: ACM, 2023, pp.1-8.
Krome, N., Kopp, S.: Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR. Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23). p. 1-8. ACM, New York, NY, USA (2023).
Krome, Niklas, and Kopp, Stefan. “Towards Real-time Co-speech Gesture Generation in Online Interaction in Social XR”. Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents (IVA 23). New York, NY, USA: ACM, 2023. 1-8.