AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes

Schillingmann, Lars; Ernst, Jessica; Keite, Verena; Wrede, Britta; Meyer, Antje S.; Belke, Eva

AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes

Schillingmann L, Ernst J, Keite V, Wrede B, Meyer AS, Belke E (2018)
BEHAVIOR RESEARCH METHODS 50(2): 466-489.

Zeitschriftenaufsatz | Veröffentlicht | Englisch

Download

Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!

DOI

https://doi.org/10.3758/s13428-017-1002-7

Autor*in

Schillingmann, Lars^UniBi; Ernst, Jessica; Keite, Verena; Wrede, Britta^UniBi ; Meyer, Antje S.; Belke, Eva

Einrichtung

Technische Fakultät > AG Angewandte Informatik

Abstract / Bemerkung

In language production research, the latency with which speakers produce a spoken response to a stimulus and the onset and offset times of words in longer utterances are key dependent variables. Measuring these variables automatically often yields partially incorrect results. However, exact measurements through the visual inspection of the recordings are extremely time-consuming. We present AlignTool, an open-source alignment tool that establishes preliminarily the onset and offset times of words and phonemes in spoken utterances using Praat, and subsequently performs a forced alignment of the spoken utterances and their orthographic transcriptions in the automatic speech recognition system MAUS. AlignTool creates a Praat TextGrid file for inspection and manual correction by the user, if necessary. We evaluated AlignTool's performance with recordings of single-word and four-word utterances as well as semi-spontaneous speech. AlignTool performs well with audio signals with an excellent signal-to-noise ratio, requiring virtually no corrections. For audio signals of lesser quality, AlignTool still is highly functional but its results may require more frequent manual corrections. We also found that audio recordings including long silent intervals tended to pose greater difficulties for AlignTool than recordings filled with speech, which AlignTool analyzed well overall. We expect that by semi-automatizing the temporal analysis of complex utterances, AlignTool will open new avenues in language production research.

Stichworte

Language production; Time course; Voice-key; Automatic alignment

Erscheinungsjahr

2018

Zeitschriftentitel

BEHAVIOR RESEARCH METHODS

Band

Ausgabe

Seite(n)

466-489

ISSN

1554-351X

eISSN

1554-3528

Page URI

https://pub.uni-bielefeld.de/record/2919297

Zitieren

Schillingmann L, Ernst J, Keite V, Wrede B, Meyer AS, Belke E. AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes. BEHAVIOR RESEARCH METHODS. 2018;50(2):466-489.

Schillingmann, L., Ernst, J., Keite, V., Wrede, B., Meyer, A. S., & Belke, E. (2018). AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes. BEHAVIOR RESEARCH METHODS, 50(2), 466-489. doi:10.3758/s13428-017-1002-7

Schillingmann, Lars, Ernst, Jessica, Keite, Verena, Wrede, Britta, Meyer, Antje S., and Belke, Eva. 2018. “AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes”. BEHAVIOR RESEARCH METHODS 50 (2): 466-489.

Schillingmann, L., Ernst, J., Keite, V., Wrede, B., Meyer, A. S., and Belke, E. (2018). AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes. BEHAVIOR RESEARCH METHODS 50, 466-489.

Schillingmann, L., et al., 2018. AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes. BEHAVIOR RESEARCH METHODS, 50(2), p 466-489.

L. Schillingmann, et al., “AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes”, BEHAVIOR RESEARCH METHODS, vol. 50, 2018, pp. 466-489.

Schillingmann, L., Ernst, J., Keite, V., Wrede, B., Meyer, A.S., Belke, E.: AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes. BEHAVIOR RESEARCH METHODS. 50, 466-489 (2018).

Schillingmann, Lars, Ernst, Jessica, Keite, Verena, Wrede, Britta, Meyer, Antje S., and Belke, Eva. “AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes”. BEHAVIOR RESEARCH METHODS 50.2 (2018): 466-489.

Daten bereitgestellt von European Bioinformatics Institute (EBI)

Zitationen in Europe PMC

Daten bereitgestellt von Europe PubMed Central.

51 References

Daten bereitgestellt von Europe PubMed Central.

VoiceRelay: voice key operation using visual basic.
Abrams L, Jennings DT., Behav Res Methods Instrum Comput 36(4), 2004
PMID: 15641422

AUTHOR UNKNOWN, 0

RH, 1995

AUTHOR UNKNOWN, 0

Timed picture naming in seven languages.
Bates E, D'Amico S, Jacobsen T, Szekely A, Andonova E, Devescovi A, Herron D, Lu CC, Pechmann T, Pleh C, Wicha N, Federmeier K, Gerdjikova I, Gutierrez G, Hung D, Hsu J, Iyer G, Kohnert K, Mehotcheva T, Orozco-Figueroa A, Tzeng A, Tzeng O., Psychon Bull Rev 10(2), 2003
PMID: 12921412

Language play facilitates language learning: Optimizing the input for gender-like category induction.
Bebout J, Belke E., Cogn Res Princ Implic 2(1), 2017
PMID: 28275704

AUTHOR UNKNOWN, 0

Language production: Methods and methodologies.
Bock K., Psychon Bull Rev 3(4), 1996
PMID: 24213975

AUTHOR UNKNOWN, 0

H, 1996

Integrating mechanisms of visual guidance in naturalistic language production.
Coco MI, Keller F., Cogn Process 16(2), 2014
PMID: 25417005

The interplay of bottom-up and top-down mechanisms in visual guidance during object naming.
Coco MI, Malcolm GL, Keller F., Q J Exp Psychol (Hove) 67(6), 2013
PMID: 24224949

W, Journal of Experimental Psychology: Human Perception & Performance 34(), 2008

AUTHOR UNKNOWN, 0

DMDX: a windows display program with millisecond accuracy.
Forster KI, Forster JC., Behav Res Methods Instrum Comput 35(1), 2003
PMID: 12723786

Pronouncing "the" as "thee" to signal problems in speaking.
Fox Tree JE, Clark HH., Cognition 62(2), 1997
PMID: 9141905

What the eyes say about speaking.
Griffin ZM, Bock K., Psychol Sci 11(4), 2000
PMID: 11273384

ZM, 2006

JE, Journal of Memory and Language 57(), 2007

Using the visual world paradigm to study language processing: a review and critical evaluation.
Huettig F, Rommers J, Meyer AS., Acta Psychol (Amst) 137(2), 2011
PMID: 21288498

SayWhen: an automated method for high-accuracy speech onset detection.
Jansen PA, Watter S., Behav Res Methods 40(3), 2008
PMID: 18697670

AUTHOR UNKNOWN, 0

B, Journal of Memory and Language 47(), 2002

AUTHOR UNKNOWN, 0

The eye-voice span during reading aloud.
Laubrock J, Kliegl R., Front Psychol 6(), 2015
PMID: 26441800

WJM, 1989

Models of word production.
Levelt WJ., Trends Cogn. Sci. (Regul. Ed.) 3(6), 1999
PMID: 10354575

WJM, Behavioral and Brain Sciences 22(), 1999

Pause and utterance duration in child-directed speech in relation to child vocabulary size.
Marklund U, Marklund E, Lacerda F, Schwarz IC., J Child Lang 42(5), 2014
PMID: 25330786

KO, Psychological Methods 1(), 1996

C, Journal of Memory and Language 49(), 2003

Lexical frequency effects on articulation: a comparison of picture naming and reading aloud.
Mousikou P, Rastle K., Front Psychol 6(), 2015
PMID: 26528223

T, Sprache & Kognition 8(), 1989

CheckVocal: a program to facilitate checking the accuracy and response time of vocal responses from DMDX.
Protopapas A., Behav Res Methods 39(4), 2007
PMID: 18183901

AUTHOR UNKNOWN, 0

K, Journal of Experimental Psychology: Human Perception and Performance 28(), 2002

Eye movements in reading and information processing: 20 years of research.
Rayner K., Psychol Bull 124(3), 1998
PMID: 9849112

AUTHOR UNKNOWN, 0

Characterizing the bilingual disadvantage in noun phrase production.
Sadat J, Martin CD, Alario FX, Costa A., J Psycholinguist Res 41(3), 2012
PMID: 21997516

AUTHOR UNKNOWN, 0

Timed picture naming norms for 590 pictures in Dutch.
Severens E, Van Lommel S, Ratinckx E, Hartsuiker RJ., Acta Psychol (Amst) 119(2), 2005
PMID: 15877979

AUTHOR UNKNOWN, 0

Variation in dual-task performance reveals late initiation of speech planning in turn-taking.
Sjerps MJ, Meyer AS., Cognition 136(), 2014
PMID: 25522192

AUTHOR UNKNOWN, 0

The delayed trigger voice key: an improved analogue voice key for psycholinguistic research.
Tyler MD, Tyler L, Burnham DK., Behav Res Methods 37(1), 2005
PMID: 16097354

S, IEEE Signal Processing Magazine 13(), 1996

Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB