Exploiting spatial descriptions in visual scene analysis
Ziegler L, Johannsen K, Swadzba A, de Ruiter J, Wachsmuth S (2012)
Cognitive Processing 13(S1): 369-374.
Zeitschriftenaufsatz
| Veröffentlicht | Englisch
Download
Ziegler_Johannsen_Swabzda_DeRuiter_Wachsmuth_2012
Autor*in
Einrichtung
Abstract / Bemerkung
The reliable automatic visual recognition of indoor scenes with complex object constellations using only sensor data is a nontrivial problem. In order to improve the construction of an accurate semantic 3D model of an indoor scene, we exploit human-produced verbal descriptions of the relative location of pairs of objects. This requires the ability to deal with different spatial reference frames (RF) that humans use interchangeably. In German, both the intrinsic and relative RF are used frequently, which often leads to ambiguities in referential communication. We assume that there are certain regularities that help in specific contexts.
In a first experiment, we investigated how speakers of German describe spatial relationships between different pieces of furniture. This gave us important information about the distribution of the RFs used for furniture-predicate combinations, and by implication also about the preferred
spatial predicate. The results of this experiment are compiled into a computational model that extracts partial orderings of spatial arrangements between furniture items from verbal descriptions.
In the implemented system, the visual scene is initially scanned by a 3D camera system. From the 3D point cloud, we extract point clusters that suggest the presence of certain furniture objects. We then integrate the partial orderings extracted from the verbal utterances incrementally and cumulatively with the estimated probabilities about the identity and location of objects in the scene, and also estimate the probable orientation of the objects.
This allows the system to significantly improve both the accuracy and richness of its visual scene representation.
Stichworte
Reference frames;
Spatial language;
3D perception;
Speech perception;
Scene interpretation;
Spatial cognition
Erscheinungsjahr
2012
Zeitschriftentitel
Cognitive Processing
Band
13
Ausgabe
S1
Seite(n)
369-374
ISSN
1612-4782
eISSN
1612-4790
Page URI
https://pub.uni-bielefeld.de/record/2528701
Zitieren
Ziegler L, Johannsen K, Swadzba A, de Ruiter J, Wachsmuth S. Exploiting spatial descriptions in visual scene analysis. Cognitive Processing. 2012;13(S1):369-374.
Ziegler, L., Johannsen, K., Swadzba, A., de Ruiter, J., & Wachsmuth, S. (2012). Exploiting spatial descriptions in visual scene analysis. Cognitive Processing, 13(S1), 369-374. doi:10.1007/s10339-012-0460-1
Ziegler, Leon, Johannsen, Katrin, Swadzba, Agnes, de Ruiter, Jan, and Wachsmuth, Sven. 2012. “Exploiting spatial descriptions in visual scene analysis”. Cognitive Processing 13 (S1): 369-374.
Ziegler, L., Johannsen, K., Swadzba, A., de Ruiter, J., and Wachsmuth, S. (2012). Exploiting spatial descriptions in visual scene analysis. Cognitive Processing 13, 369-374.
Ziegler, L., et al., 2012. Exploiting spatial descriptions in visual scene analysis. Cognitive Processing, 13(S1), p 369-374.
L. Ziegler, et al., “Exploiting spatial descriptions in visual scene analysis”, Cognitive Processing, vol. 13, 2012, pp. 369-374.
Ziegler, L., Johannsen, K., Swadzba, A., de Ruiter, J., Wachsmuth, S.: Exploiting spatial descriptions in visual scene analysis. Cognitive Processing. 13, 369-374 (2012).
Ziegler, Leon, Johannsen, Katrin, Swadzba, Agnes, de Ruiter, Jan, and Wachsmuth, Sven. “Exploiting spatial descriptions in visual scene analysis”. Cognitive Processing 13.S1 (2012): 369-374.
Volltext(e)
Name
Ziegler_Johannsen_Swabzda_DeRuiter_Wachsmuth_2012
Access Level
UniBi Only
Zuletzt Hochgeladen
2019-09-06T09:18:06Z
MD5 Prüfsumme
bcf0f031d3771a6be3e707d17f8c7bd7
Link(s) zu Volltext(en)
Access Level
Closed Access
Daten bereitgestellt von European Bioinformatics Institute (EBI)
1 Zitation in Europe PMC
Daten bereitgestellt von Europe PubMed Central.
Reference frame selection in dialog: priming or preference?
Johannsen K, Ruiter JP., Front Hum Neurosci 7(), 2013
PMID: 24137122
Johannsen K, Ruiter JP., Front Hum Neurosci 7(), 2013
PMID: 24137122
14 References
Daten bereitgestellt von Europe PubMed Central.
AUTHOR UNKNOWN, 0
AUTHOR UNKNOWN, 0
LA, Spatial Cogn Comput 1(4), 1999
Frames of reference in vision and language: where is above?
Carlson-Radvansky LA, Irwin DE., Cognition 46(3), 1993
PMID: 8462273
Carlson-Radvansky LA, Irwin DE., Cognition 46(3), 1993
PMID: 8462273
A, 2007
MA, Commun ACM 24(), 1981
AUTHOR UNKNOWN, 0
AE, Image Vis Comput 16(), 1998
SC, 2003
GD, 1996
G, 1976
A, 1998
R, 2011
AUTHOR UNKNOWN, 0
Export
Markieren/ Markierung löschen
Markierte Publikationen
Web of Science
Dieser Datensatz im Web of Science®Quellen
PMID: 22806654
PubMed | Europe PMC
Suchen in