The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ... 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 269 | 2015 |
Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise T Higuchi, N Ito, T Yoshioka, T Nakatani 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 267 | 2016 |
Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR T Higuchi, N Ito, S Araki, T Yoshioka, M Delcroix, T Nakatani IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (4), 780-793, 2017 | 128 | 2017 |
Speaker-aware neural network based beamformer for speaker extraction in speech mixtures K Žmolíková, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani Proc. Interspeech 2017, 2655-2659, 2017 | 123 | 2017 |
Frame-by-frame closed-form update for mask-based adaptive MVDR beamforming T Higuchi, K Kinoshita, N Ito, S Karita, T Nakatani 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 64 | 2018 |
Integrating DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming T Nakatani, N Ito, T Higuchi, S Araki, K Kinoshita 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 64 | 2017 |
Learning speaker representation for neural network based multichannel speaker extraction K Žmolíková, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 8-15, 2017 | 55 | 2017 |
Deep clustering-based beamforming for separation with unknown number of sources T Higuchi, K Kinoshita, M Delcroix, K Žmolíková, T Nakatani Proc. Interspeech 2017, 1183-1187, 2017 | 45 | 2017 |
Spatial correlation model based observation vector clustering and MVDR beamforming for meeting recognition S Araki, M Okada, T Higuchi, A Ogawa, T Nakatani 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 40 | 2016 |
Student's t multichannel nonnegative matrix factorization for blind source separation K Kitamura, Y Bando, K Itoyama, K Yoshii 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 1-5, 2016 | 31 | 2016 |
Underdetermined blind separation and tracking of moving sources based ONDOA-HMM T Higuchi, N Takamune, T Nakamura, H Kameoka 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 31 | 2014 |
Adversarial training for data-driven speech enhancement without parallel corpus T Higuchi, K Kinoshita, M Delcroix, T Nakatani 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 40-47, 2017 | 30 | 2017 |
Stacked 1D convolutional networks for end-to-end small footprint voice trigger detection T Higuchi, M Ghasemzadeh, K You, C Dhir arXiv preprint arXiv:2008.03405, 2020 | 23 | 2020 |
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming S Araki, N Ito, M Delcroix, A Ogawa, K Kinoshita, T Higuchi, T Yoshioka, ... 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 16-20, 2017 | 20 | 2017 |
Organic EL display device Y Sato, T Higuchi US Patent 7,027,014, 2006 | 18 | 2006 |
Non-contact data carrier T Higuchi US Patent 6,600,219, 2003 | 16 | 2003 |
Unified approach for audio source separation with multichannel factorial HMM and DOA mixture model T Higuchi, H Kameoka 2015 23rd European Signal Processing Conference (EUSIPCO), 2043-2047, 2015 | 15 | 2015 |
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models J Jung, W Zhang, J Shi, Z Aldeneh, T Higuchi, BJ Theobald, AH Abdelaziz, ... arXiv preprint arXiv:2401.17230, 2024 | 14 | 2024 |
A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden Markov models T Higuchi, H Takeda, T Nakamura, H Kameoka Fifteenth Annual Conference of the International Speech Communication …, 2014 | 14 | 2014 |
Optimization of speaker-aware multichannel speech extraction with asr criterion K Zmolikova, M Delcroix, K Kinoshita, T Higuchi, T Nakatani, J Černocký 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 13 | 2018 |