Follow
Takuya Higuchi
Title
Cited by
Cited by
Year
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices
T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ...
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
2692015
Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise
T Higuchi, N Ito, T Yoshioka, T Nakatani
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2672016
Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR
T Higuchi, N Ito, S Araki, T Yoshioka, M Delcroix, T Nakatani
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (4), 780-793, 2017
1282017
Speaker-aware neural network based beamformer for speaker extraction in speech mixtures
K Žmolíková, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani
Proc. Interspeech 2017, 2655-2659, 2017
1232017
Frame-by-frame closed-form update for mask-based adaptive MVDR beamforming
T Higuchi, K Kinoshita, N Ito, S Karita, T Nakatani
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
642018
Integrating DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming
T Nakatani, N Ito, T Higuchi, S Araki, K Kinoshita
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
642017
Learning speaker representation for neural network based multichannel speaker extraction
K Žmolíková, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 8-15, 2017
552017
Deep clustering-based beamforming for separation with unknown number of sources
T Higuchi, K Kinoshita, M Delcroix, K Žmolíková, T Nakatani
Proc. Interspeech 2017, 1183-1187, 2017
452017
Spatial correlation model based observation vector clustering and MVDR beamforming for meeting recognition
S Araki, M Okada, T Higuchi, A Ogawa, T Nakatani
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
402016
Student's t multichannel nonnegative matrix factorization for blind source separation
K Kitamura, Y Bando, K Itoyama, K Yoshii
2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 1-5, 2016
312016
Underdetermined blind separation and tracking of moving sources based ONDOA-HMM
T Higuchi, N Takamune, T Nakamura, H Kameoka
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
312014
Adversarial training for data-driven speech enhancement without parallel corpus
T Higuchi, K Kinoshita, M Delcroix, T Nakatani
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 40-47, 2017
302017
Stacked 1D convolutional networks for end-to-end small footprint voice trigger detection
T Higuchi, M Ghasemzadeh, K You, C Dhir
arXiv preprint arXiv:2008.03405, 2020
232020
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming
S Araki, N Ito, M Delcroix, A Ogawa, K Kinoshita, T Higuchi, T Yoshioka, ...
2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 16-20, 2017
202017
Organic EL display device
Y Sato, T Higuchi
US Patent 7,027,014, 2006
182006
Non-contact data carrier
T Higuchi
US Patent 6,600,219, 2003
162003
Unified approach for audio source separation with multichannel factorial HMM and DOA mixture model
T Higuchi, H Kameoka
2015 23rd European Signal Processing Conference (EUSIPCO), 2043-2047, 2015
152015
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
J Jung, W Zhang, J Shi, Z Aldeneh, T Higuchi, BJ Theobald, AH Abdelaziz, ...
arXiv preprint arXiv:2401.17230, 2024
142024
A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden Markov models
T Higuchi, H Takeda, T Nakamura, H Kameoka
Fifteenth Annual Conference of the International Speech Communication …, 2014
142014
Optimization of speaker-aware multichannel speech extraction with asr criterion
K Zmolikova, M Delcroix, K Kinoshita, T Higuchi, T Nakatani, J Černocký
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
132018
The system can't perform the operation now. Try again later.
Articles 1–20