Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

39 Publications


2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, Thilo, et al. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, Institute of Electrical and Electronics Engineers (IEEE), 2023, pp. 576–89, doi:10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann, Thilo, et al. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.
LibreCat | Files available | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann, Thilo, et al. “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.” Interspeech 2021, 2021, doi:10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper, Jens, et al. “Demystifying TasNet: A Dissecting Approach.” ICASSP 2020 Virtual Barcelona Spain, 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20505
Heitkaemper, Jens, et al. “Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments.” INTERSPEECH 2020 Virtual Shanghai China, 2020.
LibreCat | Files available
 

2018 | Conference Paper | LibreCat-ID: 17557
Abramov, Olga, et al. “Towards a Computational Model of Child Gesture-Speech Production.” KOGWIS2018: Computational Approaches to Cognitive Science, 2018.
LibreCat
 

2018 | Conference Paper | LibreCat-ID: 17179
Abramov, Olga, et al. “Towards a Computational Model of Child Gesture-Speech Production.” KOGWIS2018: Computational Approaches to Cognitive Science, 2018.
LibreCat
 

2015 | Conference Paper | LibreCat-ID: 11739 | OA
Chinaev, Aleksej, and Reinhold Haeb-Umbach. “On Optimal Smoothing in Minimum Statistics Based Noise Tracking.” Interspeech 2015, 2015, pp. 1785–89.
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11813 | OA
Heymann, Jahn, et al. “Unsupervised Adaptation of a Denoising Autoencoder by Bayesian Feature Enhancement for Reverberant Asr under Mismatch Conditions.” Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference On, 2015, pp. 5053–57, doi:10.1109/ICASSP.2015.7178933.
LibreCat | DOI | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11753 | OA
Drude, Lukas, et al. “Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models.” 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014), 2014, pp. 213–17.
LibreCat | Files available | Download (ext.)
 

2014 | Journal Article | LibreCat-ID: 11861
Leutnant, Volker, et al. “A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22, no. 1, 2014, pp. 95–109, doi:10.1109/TASLP.2013.2285480.
LibreCat | DOI
 

2014 | Journal Article | LibreCat-ID: 11867 | OA
Li, Jinyu, et al. “An Overview of Noise-Robust Automatic Speech Recognition.” IEEE Transactions on Audio, Speech and Language Processing, vol. 22, no. 4, 2014, pp. 745–77, doi:10.1109/TASLP.2014.2304637.
LibreCat | DOI | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11716
Abdelaziz, Ahmed H., et al. “GMM-Based Significance Decoding.” Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On, 2013, pp. 6827–31, doi:10.1109/ICASSP.2013.6638984.
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11841 | OA
Kinoshita, Keisuke, et al. “The Reverb Challenge: A Common Evaluation Framework for Dereverberation and Recognition of Reverberant Speech.” IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , 2013, pp. 22–23.
LibreCat | Download (ext.)
 

2013 | Journal Article | LibreCat-ID: 11862
Leutnant, Volker, et al. “Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition.” IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, no. 8, 2013, pp. 1640–52, doi:10.1109/TASL.2013.2258013.
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11917
Vu, Dang Hai Tran, and Reinhold Haeb-Umbach. “Using the Turbo Principle for Exploiting Temporal and Spectral Correlations in Speech Presence Probability Estimation.” 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), 2013, pp. 863–67, doi:10.1109/ICASSP.2013.6637771.
LibreCat | DOI
 

2012 | Conference Paper | LibreCat-ID: 11745 | OA
Chinaev, Aleksej, et al. “Improved Noise Power Spectral Density Tracking by a MAP-Based Postprocessor.” 37th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012), 2012.
LibreCat | Files available | Download (ext.)
 

2012 | Conference Paper | LibreCat-ID: 11864 | OA
Leutnant, Volker, et al. “A Statistical Observation Model For Noisy Reverberant Speech Features and Its Application to Robust ASR.” Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference On, 2012.
LibreCat | Download (ext.)
 

2011 | Journal Article | LibreCat-ID: 11850 | OA
Krueger, Alexander, et al. “Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation.” IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 1, 2011, pp. 206–19, doi:10.1109/TASL.2010.2047324.
LibreCat | DOI | Download (ext.)
 

2011 | Journal Article | LibreCat-ID: 17233
Fischer, Kerstin, et al. “Mindful Tutors: Linguistic Choice and Action Demonstration in Speech to Infants and a Simulated Robot.” Interaction Studies, vol. 12, no. 1, John Benjamins Publishing Company, 2011, pp. 134–61, doi:10.1075/is.12.1.06fis.
LibreCat | DOI
 

2010 | Journal Article | LibreCat-ID: 11846 | OA
Krueger, Alexander, and Reinhold Haeb-Umbach. “Model-Based Feature Enhancement for Reverberant Speech Recognition.” IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 7, 2010, pp. 1692–707, doi:10.1109/TASL.2010.2049684.
LibreCat | DOI | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11913 | OA
Tran Vu, Dang Hai, and Reinhold Haeb-Umbach. “Blind Speech Separation Employing Directional Statistics in an Expectation Maximization Framework.” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010), 2010, pp. 241–44, doi:10.1109/ICASSP.2010.5495994.
LibreCat | DOI | Download (ext.)
 

2010 | Journal Article | LibreCat-ID: 11892 | OA
Schmalenstroeer, Joerg, and Reinhold Haeb-Umbach. “Online Diarization of Streaming Audio-Visual Data for Smart Environments.” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, 2010, pp. 845–56, doi:10.1109/JSTSP.2010.2050519.
LibreCat | DOI | Download (ext.)
 

2009 | Journal Article | LibreCat-ID: 11937 | OA
Windmann, Stefan, and Reinhold Haeb-Umbach. “Approaches to Iterative Speech Feature Enhancement and Recognition.” IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 5, 2009, pp. 974–84, doi:10.1109/TASL.2009.2014894.
LibreCat | DOI | Download (ext.)
 

2009 | Journal Article | LibreCat-ID: 11938 | OA
Windmann, Stefan, and Reinhold Haeb-Umbach. “Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition.” IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 8, 2009, pp. 1577–90, doi:10.1109/TASL.2009.2023172.
LibreCat | DOI | Download (ext.)
 

2009 | Conference Paper | LibreCat-ID: 17272
Vollmer, Anna-Lisa, et al. “People Modify Their Tutoring Behavior in Robot-Directed Interaction for Action Learning.” Development and Learning, 2009. ICDL 2009. IEEE 8th International Conference on Development and Learning, IEEE, 2009, pp. 1–6, doi:10.1109/DEVLRN.2009.5175516.
LibreCat | DOI
 

2008 | Journal Article | LibreCat-ID: 11820 | OA
Ion, Valentin, and Reinhold Haeb-Umbach. “A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition.” IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 5, 2008, pp. 1047–60, doi:10.1109/TASL.2008.925879.
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 11935 | OA
Warsitz, Ernst, et al. “Speech Enhancement with a New Generalized Eigenvector Blocking Matrix for Application in a Generalized Sidelobe Canceller.” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 2008, pp. 73–76, doi:10.1109/ICASSP.2008.4517549.
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 11939 | OA
Windmann, Stefan, and Reinhold Haeb-Umbach. “Modeling the Dynamics of Speech and Noise for Speech Feature Enhancement in ASR.” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 2008, pp. 4409–12, doi:10.1109/ICASSP.2008.4518633.
LibreCat | DOI | Download (ext.)
 

2008 | Conference Paper | LibreCat-ID: 17278
Lohse, Manja, et al. “Try Something Else!” — When Users Change Their Discursive Behavior in Human-Robot Interaction. 2008, pp. 3481–86, doi:10.1109/ROBOT.2008.4543743.
LibreCat | DOI
 

2006 | Conference Paper | LibreCat-ID: 11824 | OA
Ion, Valentin, and Reinhold Haeb-Umbach. “An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition Based on Soft-Features.” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006), vol. 1, 2006, p. I, doi:10.1109/ICASSP.2006.1659984.
LibreCat | DOI | Download (ext.)
 

2006 | Journal Article | LibreCat-ID: 11825 | OA
Ion, Valentin, and Reinhold Haeb-Umbach. “Uncertainty Decoding for Distributed Speech Recognition over Error-Prone Networks.” Speech Communication, vol. 48, no. 11, 2006, pp. 1435–46, doi:10.1016/j.specom.2006.03.007.
LibreCat | DOI | Download (ext.)
 

2006 | Conference Paper | LibreCat-ID: 11943 | OA
Windmann, Stefan, and Reinhold Haeb-Umbach. “Iterative Speech Enhancement Using a Non-Linear Dynamic State Model of Speech and Its Parameters.” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006), vol. 1, 2006, p. I, doi:10.1109/ICASSP.2006.1660058.
LibreCat | DOI | Download (ext.)
 

2005 | Conference Paper | LibreCat-ID: 11828 | OA
Ion, Valentin, and Reinhold Haeb-Umbach. “A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services.” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005), vol. 1, 2005, pp. 333–36, doi:10.1109/ICASSP.2005.1415118.
LibreCat | DOI | Download (ext.)
 

2004 | Conference Paper | LibreCat-ID: 11931 | OA
Warsitz, Ernst, and Reinhold Haeb-Umbach. “Robust Speaker Direction Estimation with Particle Filtering.” IEEE Workshop on Multimedia Signal Processing (MMSP 2004), 2004, pp. 367–70, doi:10.1109/MMSP.2004.1436569.
LibreCat | DOI | Download (ext.)
 

2004 | Conference Paper | LibreCat-ID: 39053
Müller, Wolfgang, et al. “Interactive Multimodal User Interfaces for Mobile Devices.” Proceedings of HICCS-37, 2004, doi:10.1109/HICSS.2004.1265674.
LibreCat | DOI
 

2001 | Journal Article | LibreCat-ID: 11778 | OA
Haeb-Umbach, Reinhold. “Automatic Generation of Phonetic Regression Class Trees for MLLR Adaptation.” IEEE Transactions on Speech and Audio Processing, vol. 9, no. 3, 2001, pp. 299–302, doi:10.1109/89.906003.
LibreCat | DOI | Download (ext.)
 

2000 | Mastersthesis | LibreCat-ID: 2433
Plessl, Christian, and Simon Maurer. Hardware/Software Codesign in Speech Compression Applications. Computer Engineering and Networks Lab, ETH Zurich, Switzerland, 2000.
LibreCat
 

2000 | Conference Paper | LibreCat-ID: 11869 | OA
Lieb, M., and Reinhold Haeb-Umbach. “LDA Derived Cepstral Trajectory Filters in Adverse Environmental Conditions.” IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), vol. 2, 2000, pp. II1105-II1108 vol.2, doi:10.1109/ICASSP.2000.859157.
LibreCat | DOI | Download (ext.)
 

Filters and Search Terms

keyword="Speech"

Search

Filter Publications

Display / Sort

Citation Style: MLA

Export / Embed