DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain
Baum D, Samlowski B, Winkler T, Bardeli R, Schneider D (2009)
In: GSCL Symposium Sprachtechnologie und EHumanities. 1-9.
Konferenzbeitrag | Englisch
Download
Autor*in
Baum, Doris;
Samlowski, BarbaraUniBi;
Winkler, Thomas;
Bardeli, Rolf;
Schneider, Daniel
Abstract / Bemerkung
Systems for speech and speaker recognition already achieve low error rates when applied to high-quality audiovisual broadcast data, such as news shows recorded in a studio environment. Several evaluation corpora exist for this domain in various languages. However, in actual applications for broadcast data analysis, the data requirements are more complex. There are many data types beyond the planned speech of the news anchorperson. For example, interesting live recordings from prominent politicians are often recorded in an environment with challenging acoustic properties. Discussions typically expose highly spontaneous speech, with different speakers talking at the same time. The performance of standard approaches to speech and speaker recognition typically deteriorates under such data characteristics, and dedicated techniques have to be developed to handle these problems. Corresponding evaluation corpora are needed which reflect the challenging conditions of the actual applications.
Currently, no German evaluation corpus is available which covers the required acoustic conditions and diverse language properties. This contribution describes the design of a new speaker and speech recognition evaluation corpus for the broadcast domain, reflecting the typical problems encountered in actual applications.
Stichworte
ASR;
speaker recognition;
speech search;
evaluation corpus
Erscheinungsjahr
2009
Titel des Konferenzbandes
GSCL Symposium Sprachtechnologie und EHumanities
Seite(n)
1-9
Konferenz
Gesellschaft für Sprachtechnologie & Computerlinguistik
Konferenzort
Duisburg
Page URI
https://pub.uni-bielefeld.de/record/2527699
Zitieren
Baum D, Samlowski B, Winkler T, Bardeli R, Schneider D. DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain. In: GSCL Symposium Sprachtechnologie und EHumanities. 2009: 1-9.
Baum, D., Samlowski, B., Winkler, T., Bardeli, R., & Schneider, D. (2009). DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain. GSCL Symposium Sprachtechnologie und EHumanities, 1-9.
Baum, Doris, Samlowski, Barbara, Winkler, Thomas, Bardeli, Rolf, and Schneider, Daniel. 2009. “DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain”. In GSCL Symposium Sprachtechnologie und EHumanities, 1-9.
Baum, D., Samlowski, B., Winkler, T., Bardeli, R., and Schneider, D. (2009). “DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain” in GSCL Symposium Sprachtechnologie und EHumanities 1-9.
Baum, D., et al., 2009. DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain. In GSCL Symposium Sprachtechnologie und EHumanities. pp. 1-9.
D. Baum, et al., “DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain”, GSCL Symposium Sprachtechnologie und EHumanities, 2009, pp.1-9.
Baum, D., Samlowski, B., Winkler, T., Bardeli, R., Schneider, D.: DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain. GSCL Symposium Sprachtechnologie und EHumanities. p. 1-9. (2009).
Baum, Doris, Samlowski, Barbara, Winkler, Thomas, Bardeli, Rolf, and Schneider, Daniel. “DiSCo - A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain”. GSCL Symposium Sprachtechnologie und EHumanities. 2009. 1-9.
Alle Dateien verfügbar unter der/den folgenden Lizenz(en):
Copyright Statement:
Dieses Objekt ist durch das Urheberrecht und/oder verwandte Schutzrechte geschützt. [...]
Volltext(e)
Name
Access Level
Open Access
Zuletzt Hochgeladen
2019-09-06T09:18:06Z
MD5 Prüfsumme
a9a203721cb04a01afeaf01c0cee7c2d