Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

39 Publications


2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat | Files available | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper, J., Jakobeit, D., Boeddeker, C., Drude, L., & Haeb-Umbach, R. (2020). Demystifying TasNet: A Dissecting Approach. ICASSP 2020 Virtual Barcelona Spain.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20505
Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2020). Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. INTERSPEECH 2020 Virtual Shanghai China.
LibreCat | Files available
 

2018 | Conference Paper | LibreCat-ID: 17557
Abramov, O., Kopp, S., Nemeth, A., Kern, F., Mertens, U., & Rohlfing, K. (2018). Towards a Computational Model of Child Gesture-Speech Production. KOGWIS2018: Computational Approaches to Cognitive Science.
LibreCat
 

2018 | Conference Paper | LibreCat-ID: 17179
Abramov, O., Kopp, S., Nemeth, A., Kern, F., Mertens, U., & Rohlfing, K. (2018). Towards a Computational Model of Child Gesture-Speech Production. KOGWIS2018: Computational Approaches to Cognitive Science.
LibreCat
 

2015 | Conference Paper | LibreCat-ID: 11739 | OA
Chinaev, A., & Haeb-Umbach, R. (2015). On Optimal Smoothing in Minimum Statistics Based Noise Tracking. In Interspeech 2015 (pp. 1785–1789).
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11813 | OA
Heymann, J., Haeb-Umbach, R., Golik, P., & Schlueter, R. (2015). Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions. In Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on (pp. 5053–5057). https://doi.org/10.1109/ICASSP.2015.7178933
LibreCat | DOI | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11753 | OA
Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models. In 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014) (pp. 213–217).
LibreCat | Files available | Download (ext.)
 

2014 | Journal Article | LibreCat-ID: 11861
Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2014). A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22(1), 95–109. https://doi.org/10.1109/TASLP.2013.2285480
LibreCat | DOI
 

2014 | Journal Article | LibreCat-ID: 11867 | OA
Li, J., Deng, L., Gong, Y., & Haeb-Umbach, R. (2014). An Overview of Noise-Robust Automatic Speech Recognition. IEEE Transactions on Audio, Speech and Language Processing, 22(4), 745–777. https://doi.org/10.1109/TASLP.2014.2304637
LibreCat | DOI | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11716
Abdelaziz, A. H., Zeiler, S., Kolossa, D., Leutnant, V., & Haeb-Umbach, R. (2013). GMM-based significance decoding. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp. 6827–6831). https://doi.org/10.1109/ICASSP.2013.6638984
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11841 | OA
Kinoshita, K., Delcroix, M., Yoshioka, T., Nakatani, T., Habets, E., Haeb-Umbach, R., … Raj, B. (2013). The reverb challenge: a common evaluation framework for dereverberation and recognition of reverberant speech. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (pp. 22–23).
LibreCat | Download (ext.)
 

2013 | Journal Article | LibreCat-ID: 11862
Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2013). Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 21(8), 1640–1652. https://doi.org/10.1109/TASL.2013.2258013
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11917
Vu, D. H. T., & Haeb-Umbach, R. (2013). Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) (pp. 863–867). https://doi.org/10.1109/ICASSP.2013.6637771
LibreCat | DOI
 

2012 | Conference Paper | LibreCat-ID: 11745 | OA
Chinaev, A., Krueger, A., Tran Vu, D. H., & Haeb-Umbach, R. (2012). Improved Noise Power Spectral Density Tracking by a MAP-based Postprocessor. In 37th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012).
LibreCat | Files available | Download (ext.)
 

2012 | Conference Paper | LibreCat-ID: 11864 | OA
Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR. In Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference on.
LibreCat | Download (ext.)
 

2011 | Journal Article | LibreCat-ID: 11850 | OA
Krueger, A., Warsitz, E., & Haeb-Umbach, R. (2011). Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation. IEEE Transactions on Audio, Speech, and Language Processing, 19(1), 206–219. https://doi.org/10.1109/TASL.2010.2047324
LibreCat | DOI | Download (ext.)
 

2011 | Journal Article | LibreCat-ID: 17233
Fischer, K., Foth, K., Rohlfing, K., & Wrede, B. (2011). Mindful tutors: Linguistic choice and action demonstration in speech to infants and a simulated robot. Interaction Studies, 12(1), 134–161. https://doi.org/10.1075/is.12.1.06fis
LibreCat | DOI
 

2010 | Journal Article | LibreCat-ID: 11846 | OA
Krueger, A., & Haeb-Umbach, R. (2010). Model-Based Feature Enhancement for Reverberant Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 18(7), 1692–1707. https://doi.org/10.1109/TASL.2010.2049684
LibreCat | DOI | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11913 | OA
Tran Vu, D. H., & Haeb-Umbach, R. (2010). Blind speech separation employing directional statistics in an Expectation Maximization framework. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010) (pp. 241–244). https://doi.org/10.1109/ICASSP.2010.5495994
LibreCat | DOI | Download (ext.)
 

2010 | Journal Article | LibreCat-ID: 11892 | OA
Schmalenstroeer, J., & Haeb-Umbach, R. (2010). Online Diarization of Streaming Audio-Visual Data for Smart Environments. IEEE Journal of Selected Topics in Signal Processing, 4(5), 845–856. https://doi.org/10.1109/JSTSP.2010.2050519
LibreCat | DOI | Download (ext.)
 

2009 | Journal Article | LibreCat-ID: 11937 | OA
Windmann, S., & Haeb-Umbach, R. (2009). Approaches to Iterative Speech Feature Enhancement and Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(5), 974–984. https://doi.org/10.1109/TASL.2009.2014894
LibreCat | DOI | Download (ext.)
 

2009 | Journal Article | LibreCat-ID: 11938 | OA
Windmann, S., & Haeb-Umbach, R. (2009). Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(8), 1577–1590. https://doi.org/10.1109/TASL.2009.2023172
LibreCat | DOI | Download (ext.)
 

2009 | Conference Paper | LibreCat-ID: 17272
Vollmer, A.-L., Lohan, K. S., Fischer, K., Nagai, Y., Pitsch, K., Fritsch, J., Rohlfing, K., & Wrede, B. (2009). People modify their tutoring behavior in robot-directed interaction for action learning. Development and Learning, 2009. ICDL 2009. IEEE 8th International Conference on Development and Learning, 1–6. https://doi.org/10.1109/DEVLRN.2009.5175516
LibreCat | DOI
 

2008 | Journal Article | LibreCat-ID: 11820 | OA
Ion, V., & Haeb-Umbach, R. (2008). A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 16(5), 1047–1060. https://doi.org/10.1109/TASL.2008.925879
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 11935 | OA
Warsitz, E., Krueger, A., & Haeb-Umbach, R. (2008). Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008) (pp. 73–76). https://doi.org/10.1109/ICASSP.2008.4517549
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 11939 | OA
Windmann, S., & Haeb-Umbach, R. (2008). Modeling the dynamics of speech and noise for speech feature enhancement in ASR. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008) (pp. 4409–4412). https://doi.org/10.1109/ICASSP.2008.4518633
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 17278
Lohse, M., Rohlfing, K., Wrede, B., & Sagerer, G. (2008). “Try something else!” — When users change their discursive behavior in human-robot interaction. 3481–3486. https://doi.org/10.1109/ROBOT.2008.4543743
LibreCat | DOI
 

2006 | Conference Paper | LibreCat-ID: 11824 | OA
Ion, V., & Haeb-Umbach, R. (2006). An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition Based on Soft-Features. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006) (Vol. 1, p. I). https://doi.org/10.1109/ICASSP.2006.1659984
LibreCat | DOI | Download (ext.)
 

2006 | Journal Article | LibreCat-ID: 11825 | OA
Ion, V., & Haeb-Umbach, R. (2006). Uncertainty decoding for distributed speech recognition over error-prone networks. Speech Communication, 48(11), 1435–1446. https://doi.org/10.1016/j.specom.2006.03.007
LibreCat | DOI | Download (ext.)
 

2006 | Conference Paper | LibreCat-ID: 11943 | OA
Windmann, S., & Haeb-Umbach, R. (2006). Iterative Speech Enhancement using a Non-Linear Dynamic State Model of Speech and its Parameters. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006) (Vol. 1, p. I). https://doi.org/10.1109/ICASSP.2006.1660058
LibreCat | DOI | Download (ext.)
 

2005 | Conference Paper | LibreCat-ID: 11828 | OA
Ion, V., & Haeb-Umbach, R. (2005). A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005) (Vol. 1, pp. 333–336). https://doi.org/10.1109/ICASSP.2005.1415118
LibreCat | DOI | Download (ext.)
 

2004 | Conference Paper | LibreCat-ID: 11931 | OA
Warsitz, E., & Haeb-Umbach, R. (2004). Robust speaker direction estimation with particle filtering. In IEEE Workshop on Multimedia Signal Processing (MMSP 2004) (pp. 367–370). https://doi.org/10.1109/MMSP.2004.1436569
LibreCat | DOI | Download (ext.)
 

2004 | Conference Paper | LibreCat-ID: 39053
Müller, W., Schäfer, R., & Bleul, S. (2004). Interactive Multimodal User Interfaces for Mobile Devices. Proceedings of HICCS-37. 37th Annual Hawaii International Conference on System Sciences, Waikoloa, HI, USA. https://doi.org/10.1109/HICSS.2004.1265674
LibreCat | DOI
 

2001 | Journal Article | LibreCat-ID: 11778 | OA
Haeb-Umbach, R. (2001). Automatic generation of phonetic regression class trees for MLLR adaptation. IEEE Transactions on Speech and Audio Processing, 9(3), 299–302. https://doi.org/10.1109/89.906003
LibreCat | DOI | Download (ext.)
 

2000 | Mastersthesis | LibreCat-ID: 2433
Plessl, C., & Maurer, S. (2000). Hardware/Software Codesign in Speech Compression Applications. Computer Engineering and Networks Lab, ETH Zurich, Switzerland.
LibreCat
 

2000 | Conference Paper | LibreCat-ID: 11869 | OA
Lieb, M., & Haeb-Umbach, R. (2000). LDA derived cepstral trajectory filters in adverse environmental conditions. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000) (Vol. 2, pp. II1105-II1108 vol.2). https://doi.org/10.1109/ICASSP.2000.859157
LibreCat | DOI | Download (ext.)
 

Filters and Search Terms

keyword="Speech"

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed