Rethinking symbolic and visual context in Referring Expression Generation

Schüz, Simeon; Gatt, Albert; Zarrieß, Sina

Rethinking symbolic and visual context in Referring Expression Generation

Schüz S, Gatt A, Zarrieß S (2023)
Frontiers in Artificial Intelligence 6: 18.

Zeitschriftenaufsatz | Veröffentlicht | Englisch

Download

frai-06-1067125.pdf 1.62 MB

DOI

https://doi.org/10.3389/frai.2023.1067125

URN

urn:nbn:de:0070-pub-29697529

Autor*in

Schüz, Simeon^UniBi ; Gatt, Albert; Zarrieß, Sina^UniBi

Einrichtung

Fakultät für Linguistik und Literaturwissenschaft

Abstract / Bemerkung

Situational context is crucial for linguistic reference to visible objects, since the same description can refer unambiguously to an object in one context but be ambiguous or misleading in others. This also applies to Referring Expression Generation (REG), where the production of identifying descriptions is always dependent on a given context. Research in REG has long represented visual domains throughsymbolicinformation about objects and their properties, to determine identifying sets of target features during content determination. In recent years, research invisual REGhas turned to neural modeling and recasted the REG task as an inherently multimodal problem, looking at more natural settings such as generating descriptions for objects in photographs. Characterizing the precise ways in which context influences generation is challenging in both paradigms, as context is notoriously lacking precise definitions and categorization. In multimodal settings, however, these problems are further exacerbated by the increased complexity and low-level representation of perceptual inputs. The main goal of this article is to provide a systematic review of the types and functions of visual context across various approaches to REG so far and to argue for integrating and extending different perspectives on visual context that currently co-exist in research on REG. By analyzing the ways in which symbolic REG integrates context in rule-based approaches, we derive a set of categories of contextual integration, including the distinction betweenpositiveandnegative semantic forcesexerted by context during reference generation. Using this as a framework, we show that so far existing work in visual REG has considered only some of the ways in which visual context can facilitate end-to-end reference generation. Connecting with preceding research in related areas, as possible directions for future research, we highlight some additional ways in which contextual integration can be incorporated into REG and other multimodal generation tasks.

Stichworte

Referring Expression Generation (REG); visual context; Natural Language Generation; scene context; Vision and Language; language grounding

Erscheinungsjahr

2023

Zeitschriftentitel

Frontiers in Artificial Intelligence

Band

Seite(n)

Urheberrecht / Lizenzen

Creative Commons Namensnennung 4.0 International Public License (CC-BY 4.0)

eISSN

2624-8212

Finanzierungs-Informationen

Open-Access-Publikationskosten wurden durch die Universität Bielefeld gefördert.

Page URI

https://pub.uni-bielefeld.de/record/2969752

Zitieren

Schüz S, Gatt A, Zarrieß S. Rethinking symbolic and visual context in Referring Expression Generation. Frontiers in Artificial Intelligence. 2023;6:18.

Schüz, S., Gatt, A., & Zarrieß, S. (2023). Rethinking symbolic and visual context in Referring Expression Generation. Frontiers in Artificial Intelligence, 6, 18. https://doi.org/10.3389/frai.2023.1067125

Schüz, Simeon, Gatt, Albert, and Zarrieß, Sina. 2023. “Rethinking symbolic and visual context in Referring Expression Generation”. Frontiers in Artificial Intelligence 6: 18.

Schüz, S., Gatt, A., and Zarrieß, S. (2023). Rethinking symbolic and visual context in Referring Expression Generation. Frontiers in Artificial Intelligence 6, 18.

Schüz, S., Gatt, A., & Zarrieß, S., 2023. Rethinking symbolic and visual context in Referring Expression Generation. Frontiers in Artificial Intelligence, 6, p 18.

S. Schüz, A. Gatt, and S. Zarrieß, “Rethinking symbolic and visual context in Referring Expression Generation”, Frontiers in Artificial Intelligence, vol. 6, 2023, pp. 18.

Schüz, S., Gatt, A., Zarrieß, S.: Rethinking symbolic and visual context in Referring Expression Generation. Frontiers in Artificial Intelligence. 6, 18 (2023).

Schüz, Simeon, Gatt, Albert, and Zarrieß, Sina. “Rethinking symbolic and visual context in Referring Expression Generation”. Frontiers in Artificial Intelligence 6 (2023): 18.

Alle Dateien verfügbar unter der/den folgenden Lizenz(en):

Creative Commons Namensnennung 4.0 International Public License (CC-BY 4.0):