LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

39 Publications

2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.

LibreCat | Files available | Download (ext.)

2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20504

Heitkaemper, J., Jakobeit, D., Boeddeker, C., Drude, L., & Haeb-Umbach, R. (2020). Demystifying TasNet: A Dissecting Approach. ICASSP 2020 Virtual Barcelona Spain.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20505

Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2020). Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. INTERSPEECH 2020 Virtual Shanghai China.

LibreCat | Files available

2018 | Conference Paper | LibreCat-ID: 17557

Abramov, O., Kopp, S., Nemeth, A., Kern, F., Mertens, U., & Rohlfing, K. (2018). Towards a Computational Model of Child Gesture-Speech Production. KOGWIS2018: Computational Approaches to Cognitive Science.

LibreCat

2018 | Conference Paper | LibreCat-ID: 17179

LibreCat

2015 | Conference Paper | LibreCat-ID: 11739 |

Chinaev, A., & Haeb-Umbach, R. (2015). On Optimal Smoothing in Minimum Statistics Based Noise Tracking. In Interspeech 2015 (pp. 1785–1789).

LibreCat | Files available | Download (ext.)

2015 | Conference Paper | LibreCat-ID: 11813 |

Heymann, J., Haeb-Umbach, R., Golik, P., & Schlueter, R. (2015). Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions. In Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on (pp. 5053–5057). https://doi.org/10.1109/ICASSP.2015.7178933

LibreCat | DOI | Download (ext.)

2014 | Conference Paper | LibreCat-ID: 11753 |

Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models. In 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014) (pp. 213–217).

LibreCat | Files available | Download (ext.)

2014 | Journal Article | LibreCat-ID: 11861

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2014). A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22(1), 95–109. https://doi.org/10.1109/TASLP.2013.2285480

LibreCat | DOI

2014 | Journal Article | LibreCat-ID: 11867 |

Li, J., Deng, L., Gong, Y., & Haeb-Umbach, R. (2014). An Overview of Noise-Robust Automatic Speech Recognition. IEEE Transactions on Audio, Speech and Language Processing, 22(4), 745–777. https://doi.org/10.1109/TASLP.2014.2304637

LibreCat | DOI | Download (ext.)

2013 | Conference Paper | LibreCat-ID: 11716

Abdelaziz, A. H., Zeiler, S., Kolossa, D., Leutnant, V., & Haeb-Umbach, R. (2013). GMM-based significance decoding. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp. 6827–6831). https://doi.org/10.1109/ICASSP.2013.6638984

LibreCat | DOI

2013 | Conference Paper | LibreCat-ID: 11841 |

Kinoshita, K., Delcroix, M., Yoshioka, T., Nakatani, T., Habets, E., Haeb-Umbach, R., … Raj, B. (2013). The reverb challenge: a common evaluation framework for dereverberation and recognition of reverberant speech. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (pp. 22–23).

LibreCat | Download (ext.)

2013 | Journal Article | LibreCat-ID: 11862

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2013). Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 21(8), 1640–1652. https://doi.org/10.1109/TASL.2013.2258013

LibreCat | DOI

2013 | Conference Paper | LibreCat-ID: 11917

Vu, D. H. T., & Haeb-Umbach, R. (2013). Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) (pp. 863–867). https://doi.org/10.1109/ICASSP.2013.6637771

LibreCat | DOI

2012 | Conference Paper | LibreCat-ID: 11745 |

Chinaev, A., Krueger, A., Tran Vu, D. H., & Haeb-Umbach, R. (2012). Improved Noise Power Spectral Density Tracking by a MAP-based Postprocessor. In 37th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012).

LibreCat | Files available | Download (ext.)

2012 | Conference Paper | LibreCat-ID: 11864 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR. In Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference on.

LibreCat | Download (ext.)

2011 | Journal Article | LibreCat-ID: 11850 |

Krueger, A., Warsitz, E., & Haeb-Umbach, R. (2011). Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation. IEEE Transactions on Audio, Speech, and Language Processing, 19(1), 206–219. https://doi.org/10.1109/TASL.2010.2047324

LibreCat | DOI | Download (ext.)

2011 | Journal Article | LibreCat-ID: 17233

Fischer, K., Foth, K., Rohlfing, K., & Wrede, B. (2011). Mindful tutors: Linguistic choice and action demonstration in speech to infants and a simulated robot. Interaction Studies, 12(1), 134–161. https://doi.org/10.1075/is.12.1.06fis

LibreCat | DOI

2010 | Journal Article | LibreCat-ID: 11846 |

Krueger, A., & Haeb-Umbach, R. (2010). Model-Based Feature Enhancement for Reverberant Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 18(7), 1692–1707. https://doi.org/10.1109/TASL.2010.2049684

LibreCat | DOI | Download (ext.)

2010 | Conference Paper | LibreCat-ID: 11913 |

Tran Vu, D. H., & Haeb-Umbach, R. (2010). Blind speech separation employing directional statistics in an Expectation Maximization framework. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010) (pp. 241–244). https://doi.org/10.1109/ICASSP.2010.5495994

LibreCat | DOI | Download (ext.)

2010 | Journal Article | LibreCat-ID: 11892 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2010). Online Diarization of Streaming Audio-Visual Data for Smart Environments. IEEE Journal of Selected Topics in Signal Processing, 4(5), 845–856. https://doi.org/10.1109/JSTSP.2010.2050519

LibreCat | DOI | Download (ext.)

2009 | Journal Article | LibreCat-ID: 11937 |

Windmann, S., & Haeb-Umbach, R. (2009). Approaches to Iterative Speech Feature Enhancement and Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(5), 974–984. https://doi.org/10.1109/TASL.2009.2014894

LibreCat | DOI | Download (ext.)

2009 | Journal Article | LibreCat-ID: 11938 |

Windmann, S., & Haeb-Umbach, R. (2009). Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(8), 1577–1590. https://doi.org/10.1109/TASL.2009.2023172

LibreCat | DOI | Download (ext.)

2009 | Conference Paper | LibreCat-ID: 17272

Vollmer, A.-L., Lohan, K. S., Fischer, K., Nagai, Y., Pitsch, K., Fritsch, J., Rohlfing, K., & Wrede, B. (2009). People modify their tutoring behavior in robot-directed interaction for action learning. Development and Learning, 2009. ICDL 2009. IEEE 8th International Conference on Development and Learning, 1–6. https://doi.org/10.1109/DEVLRN.2009.5175516

LibreCat | DOI

2008 | Journal Article | LibreCat-ID: 11820 |

Ion, V., & Haeb-Umbach, R. (2008). A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 16(5), 1047–1060. https://doi.org/10.1109/TASL.2008.925879

LibreCat | DOI | Download (ext.)

2008 | Conference Paper | LibreCat-ID: 11935 |

Warsitz, E., Krueger, A., & Haeb-Umbach, R. (2008). Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008) (pp. 73–76). https://doi.org/10.1109/ICASSP.2008.4517549

LibreCat | DOI | Download (ext.)

2008 | Conference Paper | LibreCat-ID: 11939 |

Windmann, S., & Haeb-Umbach, R. (2008). Modeling the dynamics of speech and noise for speech feature enhancement in ASR. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008) (pp. 4409–4412). https://doi.org/10.1109/ICASSP.2008.4518633

LibreCat | DOI | Download (ext.)

2008 | Conference Paper | LibreCat-ID: 17278

Lohse, M., Rohlfing, K., Wrede, B., & Sagerer, G. (2008). “Try something else!” — When users change their discursive behavior in human-robot interaction. 3481–3486. https://doi.org/10.1109/ROBOT.2008.4543743

LibreCat | DOI

2006 | Conference Paper | LibreCat-ID: 11824 |

Ion, V., & Haeb-Umbach, R. (2006). An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition Based on Soft-Features. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006) (Vol. 1, p. I). https://doi.org/10.1109/ICASSP.2006.1659984

LibreCat | DOI | Download (ext.)

2006 | Journal Article | LibreCat-ID: 11825 |

Ion, V., & Haeb-Umbach, R. (2006). Uncertainty decoding for distributed speech recognition over error-prone networks. Speech Communication, 48(11), 1435–1446. https://doi.org/10.1016/j.specom.2006.03.007

LibreCat | DOI | Download (ext.)

2006 | Conference Paper | LibreCat-ID: 11943 |

Windmann, S., & Haeb-Umbach, R. (2006). Iterative Speech Enhancement using a Non-Linear Dynamic State Model of Speech and its Parameters. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006) (Vol. 1, p. I). https://doi.org/10.1109/ICASSP.2006.1660058

LibreCat | DOI | Download (ext.)

2005 | Conference Paper | LibreCat-ID: 11828 |

Ion, V., & Haeb-Umbach, R. (2005). A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005) (Vol. 1, pp. 333–336). https://doi.org/10.1109/ICASSP.2005.1415118

LibreCat | DOI | Download (ext.)

2004 | Conference Paper | LibreCat-ID: 11931 |

Warsitz, E., & Haeb-Umbach, R. (2004). Robust speaker direction estimation with particle filtering. In IEEE Workshop on Multimedia Signal Processing (MMSP 2004) (pp. 367–370). https://doi.org/10.1109/MMSP.2004.1436569

LibreCat | DOI | Download (ext.)

2004 | Conference Paper | LibreCat-ID: 39053

Müller, W., Schäfer, R., & Bleul, S. (2004). Interactive Multimodal User Interfaces for Mobile Devices. Proceedings of HICCS-37. 37th Annual Hawaii International Conference on System Sciences, Waikoloa, HI, USA. https://doi.org/10.1109/HICSS.2004.1265674

LibreCat | DOI

2001 | Journal Article | LibreCat-ID: 11778 |

Haeb-Umbach, R. (2001). Automatic generation of phonetic regression class trees for MLLR adaptation. IEEE Transactions on Speech and Audio Processing, 9(3), 299–302. https://doi.org/10.1109/89.906003

LibreCat | DOI | Download (ext.)

2000 | Mastersthesis | LibreCat-ID: 2433

Plessl, C., & Maurer, S. (2000). Hardware/Software Codesign in Speech Compression Applications. Computer Engineering and Networks Lab, ETH Zurich, Switzerland.

LibreCat

2000 | Conference Paper | LibreCat-ID: 11869 |

Lieb, M., & Haeb-Umbach, R. (2000). LDA derived cepstral trajectory filters in adverse environmental conditions. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000) (Vol. 2, pp. II1105-II1108 vol.2). https://doi.org/10.1109/ICASSP.2000.859157

LibreCat | DOI | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

39 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

39 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options