Volgen
Woo Hyun (Woohyun) Kang
Woo Hyun (Woohyun) Kang
Amazon Web Services (AWS)
Geverifieerd e-mailadres voor amazon.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Softflow: Probabilistic framework for normalizing flow on manifolds
H Kim, H Lee, WH Kang, JY Lee, NS Kim
Advances in Neural Information Processing Systems 33, 16388-16397, 2020
1212020
A multi-resolution approach to GAN-based speech enhancement
HY Kim, JW Yoon, SJ Cheon, WH Kang, NS Kim
Applied Sciences 11 (2), 721, 2021
282021
Unsupervised representation learning for speaker recognition via contrastive equilibrium learning
SH Mun, WH Kang, MH Han, NS Kim
arXiv preprint arXiv:2010.11433, 2020
262020
Two-stage noise aware training using asymmetric deep denoising autoencoder
KH Lee, SJ Kang, WH Kang, NS Kim
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
252016
CRIM’s system description for the ASVSpoof2021 challenge
WH Kang, J Alam, A Fathan
Proc. ASVspoof 2021 Workshop, 100-106, 2021
232021
Disentangled speaker and nuisance attribute embedding for robust speaker verification
WH Kang, SH Mun, MH Han, NS Kim
IEEE Access 8, 141838-141849, 2020
222020
Text-independent speaker verification employing CNN-LSTM-TDNN hybrid networks
J Alam, A Fathan, WH Kang
Speech and Computer: 23rd International Conference, SPECOM 2021, St …, 2021
182021
Investigation on activation functions for robust end-to-end spoofing attack detection system
WH Kang, J Alam, A Fathan
Proc. 2021 Edition of the Automatic Speaker Verification and Spoofing …, 2021
172021
On the impact of the quality of pseudo-labels on the self-supervised speaker verification task
A Fathan, J Alam, W Kang
NeurIPS ENLSP Workshop, 2022
132022
WaveNODE: A continuous normalizing flow for speech synthesis
H Kim, H Lee, WH Kang, SJ Cheon, BJ Choi, NS Kim
arXiv preprint arXiv:2006.04598, 2020
132020
Mel-spectrogram image-based end-to-end audio deepfake detection under channel-mismatched conditions
A Fathan, J Alam, WH Kang
2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022
122022
Hybrid network with multi-level global-local statistics pooling for robust text-independent speaker recognition
WH Kang, J Alam, A Fathan
Proc. of Automatic Speech Recognition and Understanding (ASRU), 2021
112021
Real-time automatic word segmentation for user-generated text
WI Cho, SJ Cheon, WH Kang, JW Kim, NS Kim
arXiv preprint arXiv:1810.13113, 2018
112018
An analytic study on clustering-based pseudo-labels for self-supervised deep speaker verification
WH Kang, J Alam, A Fathan
International Conference on Speech and Computer, 338-348, 2022
82022
End-to-end framework for spoof-aware speaker verification.
WH Kang, J Alam, A Fathan
INTERSPEECH, 4362-4366, 2022
82022
L-mix: A latent-level instance mixup regularization for robust self-supervised speaker representation learning
WH Kang, J Alam, A Fathan
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1263-1272, 2022
82022
Integrated DNN-based model adaptation technique for noise-robust speech recognition
KH Lee, WH Kang, TG Kang, NS Kim
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
82017
Gated recurrent context: Softmax-free attention for online encoder-decoder speech recognition
H Lee, WH Kang, SJ Cheon, H Kim, NS Kim
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 710-719, 2021
72021
Hybrid neural network with cross-and self-module attention pooling for text-independent speaker verification
J Alam, WH Kang, A Fathan
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
62023
Information Preservation Pooling for Speaker Embedding.
MH Han, WH Kang, SH Mun, NS Kim
Odyssey, 60-66, 2020
62020
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20