Follow
Shota Horiguchi
Shota Horiguchi
Verified email at hitachi.com
Title
Cited by
Cited by
Year
CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
6th International Workshop on Speech Processing in Everyday Environments …, 2020
1482020
End-to-End Neural Speaker Diarization with Self-attention
Y Fujita, N Kanda, S Horiguchi, Y Xue, K Nagamatsu, S Watanabe
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 296-303, 2019
1152019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
Y Fujita, N Kanda, S Horiguchi, K Nagamatsu, S Watanabe
INTERSPEECH, 4300–4304, 2019
1142019
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
S Horiguchi, Y Fujita, S Watanabe, Y Xue, K Nagamatsu
INTERSPEECH, 269-273, 2020
782020
Significance of Softmax-based Features in Comparison to Distance Metric Learning-based Features
S Horiguchi, D Ikami, K Aizawa
IEEE Transactions on Pattern Analysis and Machine Intelligence 42 (5), 1279-1285, 2020
67*2020
Personalized Classifier for Food Image Recognition
S Horiguchi, S Amano, M Ogawa, K Aizawa
IEEE Transactions on Multimedia 20 (10), 2836-2848, 2018
622018
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
N Kanda, C Boeddeker, J Heitkaemper, Y Fujita, S Horiguchi, ...
INTERSPEECH, 1248-1252, 2019
492019
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays
N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ...
The 5th International Workshop on Speech Processing in Everyday Environments …, 2018
452018
Acoustic Modeling for Distant Multi-talker Speech Recognition with Single- and Multi-channel Branches
N Kanda, Y Fujita, S Horiguchi, R Ikeshita, K Nagamatsu, S Watanabe
IEEE International Conference on Acoustics, Speech, and Signal Processing …, 2019
362019
Face-Voice Matching using Cross-modal Embeddings
S Horiguchi, N Kanda, K Nagamatsu
ACM International Conference on Multimedia (ACMMM), 1011-1019, 2018
292018
Neural speaker diarization with speaker-wise chain rule
Y Fujita, S Watanabe, S Horiguchi, Y Xue, J Shi, K Nagamatsu
arXiv preprint arXiv:2006.01796, 2020
272020
End-to-end neural diarization: Reformulating speaker diarization as simple multi-label classification
Y Fujita, S Watanabe, S Horiguchi, Y Xue, K Nagamatsu
arXiv preprint arXiv:2003.02966, 2020
272020
Omnidirectional Pedestrian Detection by Rotation Invariant Training
M Tamura, S Horiguchi, T Murakami
IEEE Winter Conference on Applications of Computer Vision (WACV), 1989-1998, 2019
232019
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models
N Kanda, S Horiguchi, Y Fujita, Y Xue, K Nagamatsu, S Watanabe
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 31-38, 2019
222019
End-to-End Speaker Diarization as Post-Processing
S Horiguchi, P García, Y Fujita, S Watanabe, K Nagamatsu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
192021
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
N Kanda, S Horiguchi, R Takashima, Y Fujita, K Nagamatsu, S Watanabe
INTERSPEECH, 236-240, 2019
192019
Online end-to-end neural diarization with speaker-tracing buffer
Y Xue, S Horiguchi, Y Fujita, S Watanabe, P García, K Nagamatsu
2021 IEEE Spoken Language Technology Workshop (SLT), 841-848, 2021
172021
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-vector Clustering Systems Combined by DOVER-Lap
S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ...
arXiv preprint arXiv:2102.01363, 2021
152021
End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Y Takashima, Y Fujita, S Watanabe, S Horiguchi, P García, K Nagamatsu
IEEE Spoken Language Technology Workshop (SLT), 849-856, 2021
82021
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors
S Horiguchi, S Watanabe, P Garcia, Y Xue, Y Takashima, Y Kawaguchi
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 98-105, 2022
72022
The system can't perform the operation now. Try again later.
Articles 1–20