Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

39 Publications


2023 | Journal Article | LibreCat-ID: 35602 | OA
Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, IEEE/ACM Transactions on Audio, Speech, and Language Processing 31 (2023) 576–589.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems
T. von Neumann, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, in: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.
LibreCat | Files available | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, in: Interspeech 2021, 2021.
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20504
Demystifying TasNet: A Dissecting Approach
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, R. Haeb-Umbach, in: ICASSP 2020 Virtual Barcelona Spain, 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20505
Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments
J. Heitkaemper, J. Schmalenstroeer, R. Haeb-Umbach, in: INTERSPEECH 2020 Virtual Shanghai China, 2020.
LibreCat | Files available
 

2018 | Conference Paper | LibreCat-ID: 17557
Towards a Computational Model of Child Gesture-Speech Production
O. Abramov, S. Kopp, A. Nemeth, F. Kern, U. Mertens, K. Rohlfing, in: KOGWIS2018: Computational Approaches to Cognitive Science, 2018.
LibreCat
 

2018 | Conference Paper | LibreCat-ID: 17179
Towards a Computational Model of Child Gesture-Speech Production
O. Abramov, S. Kopp, A. Nemeth, F. Kern, U. Mertens, K. Rohlfing, in: KOGWIS2018: Computational Approaches to Cognitive Science, 2018.
LibreCat
 

2015 | Conference Paper | LibreCat-ID: 11739 | OA
On Optimal Smoothing in Minimum Statistics Based Noise Tracking
A. Chinaev, R. Haeb-Umbach, in: Interspeech 2015, 2015, pp. 1785–1789.
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11813 | OA
Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions
J. Heymann, R. Haeb-Umbach, P. Golik, R. Schlueter, in: Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference On, 2015, pp. 5053–5057.
LibreCat | DOI | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11753 | OA
Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models
L. Drude, A. Chinaev, D.H. Tran Vu, R. Haeb-Umbach, in: 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014), 2014, pp. 213–217.
LibreCat | Files available | Download (ext.)
 

2014 | Journal Article | LibreCat-ID: 11861
A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech
V. Leutnant, A. Krueger, R. Haeb-Umbach, IEEE/ACM Transactions on Audio, Speech, and Language Processing 22 (2014) 95–109.
LibreCat | DOI
 

2014 | Journal Article | LibreCat-ID: 11867 | OA
An Overview of Noise-Robust Automatic Speech Recognition
J. Li, L. Deng, Y. Gong, R. Haeb-Umbach, IEEE Transactions on Audio, Speech and Language Processing 22 (2014) 745–777.
LibreCat | DOI | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11716
GMM-based significance decoding
A.H. Abdelaziz, S. Zeiler, D. Kolossa, V. Leutnant, R. Haeb-Umbach, in: Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On, 2013, pp. 6827–6831.
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11841 | OA
The reverb challenge: a common evaluation framework for dereverberation and recognition of reverberant speech
K. Kinoshita, M. Delcroix, T. Yoshioka, T. Nakatani, E. Habets, R. Haeb-Umbach, V. Leutnant, A. Sehr, W. Kellermann, R. Maas, S. Gannot, B. Raj, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , 2013, pp. 22–23.
LibreCat | Download (ext.)
 

2013 | Journal Article | LibreCat-ID: 11862
Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition
V. Leutnant, A. Krueger, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language Processing 21 (2013) 1640–1652.
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11917
Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation
D.H.T. Vu, R. Haeb-Umbach, in: 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), 2013, pp. 863–867.
LibreCat | DOI
 

2012 | Conference Paper | LibreCat-ID: 11745 | OA
Improved Noise Power Spectral Density Tracking by a MAP-based Postprocessor
A. Chinaev, A. Krueger, D.H. Tran Vu, R. Haeb-Umbach, in: 37th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012), 2012.
LibreCat | Files available | Download (ext.)
 

2012 | Conference Paper | LibreCat-ID: 11864 | OA
A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR
V. Leutnant, A. Krueger, R. Haeb-Umbach, in: Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference On, 2012.
LibreCat | Download (ext.)
 

2011 | Journal Article | LibreCat-ID: 11850 | OA
Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation
A. Krueger, E. Warsitz, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language Processing 19 (2011) 206–219.
LibreCat | DOI | Download (ext.)
 

2011 | Journal Article | LibreCat-ID: 17233
Mindful tutors: Linguistic choice and action demonstration in speech to infants and a simulated robot
K. Fischer, K. Foth, K. Rohlfing, B. Wrede, Interaction Studies 12 (2011) 134–161.
LibreCat | DOI
 

2010 | Journal Article | LibreCat-ID: 11846 | OA
Model-Based Feature Enhancement for Reverberant Speech Recognition
A. Krueger, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language Processing 18 (2010) 1692–1707.
LibreCat | DOI | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11913 | OA
Blind speech separation employing directional statistics in an Expectation Maximization framework
D.H. Tran Vu, R. Haeb-Umbach, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010), 2010, pp. 241–244.
LibreCat | DOI | Download (ext.)
 

2010 | Journal Article | LibreCat-ID: 11892 | OA
Online Diarization of Streaming Audio-Visual Data for Smart Environments
J. Schmalenstroeer, R. Haeb-Umbach, IEEE Journal of Selected Topics in Signal Processing 4 (2010) 845–856.
LibreCat | DOI | Download (ext.)
 

2009 | Journal Article | LibreCat-ID: 11937 | OA
Approaches to Iterative Speech Feature Enhancement and Recognition
S. Windmann, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language Processing 17 (2009) 974–984.
LibreCat | DOI | Download (ext.)
 

2009 | Journal Article | LibreCat-ID: 11938 | OA
Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition
S. Windmann, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language Processing 17 (2009) 1577–1590.
LibreCat | DOI | Download (ext.)
 

2009 | Conference Paper | LibreCat-ID: 17272
People modify their tutoring behavior in robot-directed interaction for action learning
A.-L. Vollmer, K.S. Lohan, K. Fischer, Y. Nagai, K. Pitsch, J. Fritsch, K. Rohlfing, B. Wrede, in: Development and Learning, 2009. ICDL 2009. IEEE 8th International Conference on Development and Learning, IEEE, 2009, pp. 1–6.
LibreCat | DOI
 

2008 | Journal Article | LibreCat-ID: 11820 | OA
A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition
V. Ion, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language Processing 16 (2008) 1047–1060.
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 11935 | OA
Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller
E. Warsitz, A. Krueger, R. Haeb-Umbach, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 2008, pp. 73–76.
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 11939 | OA
Modeling the dynamics of speech and noise for speech feature enhancement in ASR
S. Windmann, R. Haeb-Umbach, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 2008, pp. 4409–4412.
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 17278
“Try something else!” — When users change their discursive behavior in human-robot interaction
M. Lohse, K. Rohlfing, B. Wrede, G. Sagerer, in: 2008, pp. 3481–3486.
LibreCat | DOI
 

2006 | Conference Paper | LibreCat-ID: 11824 | OA
An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition Based on Soft-Features
V. Ion, R. Haeb-Umbach, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006), 2006, p. I.
LibreCat | DOI | Download (ext.)
 

2006 | Journal Article | LibreCat-ID: 11825 | OA
Uncertainty decoding for distributed speech recognition over error-prone networks
V. Ion, R. Haeb-Umbach, Speech Communication 48 (2006) 1435–1446.
LibreCat | DOI | Download (ext.)
 

2006 | Conference Paper | LibreCat-ID: 11943 | OA
Iterative Speech Enhancement using a Non-Linear Dynamic State Model of Speech and its Parameters
S. Windmann, R. Haeb-Umbach, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006), 2006, p. I.
LibreCat | DOI | Download (ext.)
 

2005 | Conference Paper | LibreCat-ID: 11828 | OA
A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services
V. Ion, R. Haeb-Umbach, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005), 2005, pp. 333–336.
LibreCat | DOI | Download (ext.)
 

2004 | Conference Paper | LibreCat-ID: 11931 | OA
Robust speaker direction estimation with particle filtering
E. Warsitz, R. Haeb-Umbach, in: IEEE Workshop on Multimedia Signal Processing (MMSP 2004), 2004, pp. 367–370.
LibreCat | DOI | Download (ext.)
 

2004 | Conference Paper | LibreCat-ID: 39053
Interactive Multimodal User Interfaces for Mobile Devices
W. Müller, R. Schäfer, S. Bleul, in: Proceedings of HICCS-37, Waikoloa, HI, USA, 2004.
LibreCat | DOI
 

2001 | Journal Article | LibreCat-ID: 11778 | OA
Automatic generation of phonetic regression class trees for MLLR adaptation
R. Haeb-Umbach, IEEE Transactions on Speech and Audio Processing 9 (2001) 299–302.
LibreCat | DOI | Download (ext.)
 

2000 | Mastersthesis | LibreCat-ID: 2433
Hardware/Software Codesign in Speech Compression Applications
C. Plessl, S. Maurer, Hardware/Software Codesign in Speech Compression Applications, Computer Engineering and Networks Lab, ETH Zurich, Switzerland, 2000.
LibreCat
 

2000 | Conference Paper | LibreCat-ID: 11869 | OA
LDA derived cepstral trajectory filters in adverse environmental conditions
M. Lieb, R. Haeb-Umbach, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), 2000, pp. II1105-II1108 vol.2.
LibreCat | DOI | Download (ext.)
 

Filters and Search Terms

keyword="Speech"

Search

Filter Publications

Display / Sort

Export / Embed