3 Publications
2020 | Conference Paper | LibreCat-ID: 20766
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation
K. Kinoshita, T.C. von Neumann, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: Proc. Interspeech 2020, 2020, pp. 2652–2656.
LibreCat
| Files available
| DOI
K. Kinoshita, T.C. von Neumann, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: Proc. Interspeech 2020, 2020, pp. 2652–2656.
2020 | Conference Paper | LibreCat-ID: 20762
End-to-End Training of Time Domain Audio Separation and Recognition
T.C. von Neumann, K. Kinoshita, L. Drude, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008.
LibreCat
| Files available
| DOI
T.C. von Neumann, K. Kinoshita, L. Drude, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008.
2020 | Conference Paper | LibreCat-ID: 20764
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR
T.C. von Neumann, C. Boeddeker, L. Drude, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, T. von Neuann, in: Proc. Interspeech 2020, 2020, pp. 3097–3101.
LibreCat
| Files available
| DOI
T.C. von Neumann, C. Boeddeker, L. Drude, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, T. von Neuann, in: Proc. Interspeech 2020, 2020, pp. 3097–3101.
3 Publications
2020 | Conference Paper | LibreCat-ID: 20766
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation
K. Kinoshita, T.C. von Neumann, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: Proc. Interspeech 2020, 2020, pp. 2652–2656.
LibreCat
| Files available
| DOI
K. Kinoshita, T.C. von Neumann, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: Proc. Interspeech 2020, 2020, pp. 2652–2656.
2020 | Conference Paper | LibreCat-ID: 20762
End-to-End Training of Time Domain Audio Separation and Recognition
T.C. von Neumann, K. Kinoshita, L. Drude, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008.
LibreCat
| Files available
| DOI
T.C. von Neumann, K. Kinoshita, L. Drude, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008.
2020 | Conference Paper | LibreCat-ID: 20764
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR
T.C. von Neumann, C. Boeddeker, L. Drude, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, T. von Neuann, in: Proc. Interspeech 2020, 2020, pp. 3097–3101.
LibreCat
| Files available
| DOI
T.C. von Neumann, C. Boeddeker, L. Drude, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, T. von Neuann, in: Proc. Interspeech 2020, 2020, pp. 3097–3101.