Conversational Speech Recognition Using Acoustic and Articulatory Input
Kirchhoff K, Fink GA, Sagerer G (2000)
In: IEEE International Conference on Acoustics, Speech and Signal Processing. Istanbul.
Konferenzbeitrag
| Veröffentlicht | Englisch
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Kirchhoff, Katrin;
Fink, Gernot A.;
Sagerer, GerhardUniBi
Einrichtung
Abstract / Bemerkung
The combination of multiple speech recognizers based on different signal representations is increasingly attracting interest in the speech community. In previous work we presented a hybrid speech recognition system based on the combination of acoustic and articulatory information which achieved significant word error rate reductions under highly noisy conditions on a small-vocabulary numbers recognition task. In this study we extend this approach to large-vocabulary conversational speech recognition using the Gaussian mixture acoustic modeling paradigm. We demonstrate that the articulatory input representation we propose contains information which is complementary to that provided by standard MFCC features, and their combination can significantly reduce the word error rate on a large-vocabulary, conversational speech recognition task. Various combination strategies (feature-level, state-level and word-level combination) are compared and evaluated.
Erscheinungsjahr
2000
Titel des Konferenzbandes
IEEE International Conference on Acoustics, Speech and Signal Processing
Page URI
https://pub.uni-bielefeld.de/record/2618856
Zitieren
Kirchhoff K, Fink GA, Sagerer G. Conversational Speech Recognition Using Acoustic and Articulatory Input. In: IEEE International Conference on Acoustics, Speech and Signal Processing. Istanbul; 2000.
Kirchhoff, K., Fink, G. A., & Sagerer, G. (2000). Conversational Speech Recognition Using Acoustic and Articulatory Input. IEEE International Conference on Acoustics, Speech and Signal Processing
Kirchhoff, Katrin, Fink, Gernot A., and Sagerer, Gerhard. 2000. “Conversational Speech Recognition Using Acoustic and Articulatory Input”. In IEEE International Conference on Acoustics, Speech and Signal Processing. Istanbul.
Kirchhoff, K., Fink, G. A., and Sagerer, G. (2000). “Conversational Speech Recognition Using Acoustic and Articulatory Input” in IEEE International Conference on Acoustics, Speech and Signal Processing (Istanbul).
Kirchhoff, K., Fink, G.A., & Sagerer, G., 2000. Conversational Speech Recognition Using Acoustic and Articulatory Input. In IEEE International Conference on Acoustics, Speech and Signal Processing. Istanbul.
K. Kirchhoff, G.A. Fink, and G. Sagerer, “Conversational Speech Recognition Using Acoustic and Articulatory Input”, IEEE International Conference on Acoustics, Speech and Signal Processing, Istanbul: 2000.
Kirchhoff, K., Fink, G.A., Sagerer, G.: Conversational Speech Recognition Using Acoustic and Articulatory Input. IEEE International Conference on Acoustics, Speech and Signal Processing. Istanbul (2000).
Kirchhoff, Katrin, Fink, Gernot A., and Sagerer, Gerhard. “Conversational Speech Recognition Using Acoustic and Articulatory Input”. IEEE International Conference on Acoustics, Speech and Signal Processing. Istanbul, 2000.