Follow
Wei-Ning Hsu
Wei-Ning Hsu
Facebook AI Research (FAIR)
Verified email at csail.mit.edu - Homepage
Title
Cited by
Cited by
Year
Unsupervised learning of disentangled and interpretable representations from sequential data
WN Hsu, Y Zhang, J Glass
Thirty-first Conference on Neural Information Processing Systems (NeurIPS), 2017
3062017
An unsupervised autoregressive model for speech representation learning
YA Chung, WN Hsu, H Tang, J Glass
INTERSPEECH, 2019
2532019
Hubert: Self-supervised speech representation learning by masked prediction of hidden units
WN Hsu, B Bolte, YHH Tsai, K Lakhotia, R Salakhutdinov, A Mohamed
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3451-3460, 2021
2422021
Hierarchical generative modeling for controllable speech synthesis
WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ...
Seventh International Conference on Learning Representations (ICLR), 2019
1742019
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
1432019
Learning Latent Representations for Speech Generation and Transformation
WN Hsu, Y Zhang, J Glass
INTERSPEECH, 1273-1277, 2017
1422017
Active learning by learning
WN Hsu, HT Lin
Twenty-Ninth AAAI conference on artificial intelligence, 2015
1242015
Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation
WN Hsu, Y Zhang, J Glass
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16-23, 2017
1162017
Semi-supervised training for improving data efficiency in end-to-end speech synthesis
YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1002019
Disentangling correlated speaker and noise for speech synthesis via data augmentation and adversarial factorization
WN Hsu, Y Zhang, RJ Weiss, YA Chung, Y Wang, Y Wu, J Glass
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
942019
Unsupervised speech recognition
A Baevski, WN Hsu, A Conneau, M Auli
Advances in Neural Information Processing Systems 34, 27826-27839, 2021
832021
Multi-channel speech recognition: LSTMs all the way through
H Erdogan, T Hayashi, JR Hershey, T Hori, C Hori, WN Hsu, S Kim, ...
CHiME-4 workshop, 1-4, 2016
772016
Data2vec: A general framework for self-supervised learning in speech, vision and language
A Baevski, WN Hsu, Q Xu, A Babu, J Gu, M Auli
arXiv preprint arXiv:2202.03555, 2022
722022
HuBERT: How much can a bad teacher benefit ASR pre-training?
WN Hsu, YHH Tsai, B Bolte, R Salakhutdinov, A Mohamed
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
682021
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech
D Harwath, WN Hsu, J Glass
Eighth International Conference on Learning Representations (ICLR), 2020
682020
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
WN Hsu, A Sriram, A Baevski, T Likhomanenko, Q Xu, V Pratap, J Kahn, ...
INTERSPEECH, 2021
572021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ...
INTERSPEECH, 2021
522021
On generative spoken language modeling from raw audio
K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ...
Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021
502021
Neural attention for learning to rank questions in community question answering
S Romeo, G Da San Martino, A Barrón-Cedeno, A Moschitti, Y Belinkov, ...
Proceedings of COLING 2016, the 26th International Conference on …, 2016
422016
Extracting domain invariant features by unsupervised learning for robust automatic speech recognition
WN Hsu, J Glass
2018 IEEE international conference on acoustics, speech and signal …, 2018
382018
The system can't perform the operation now. Try again later.
Articles 1–20