Suivre
Matt Shannon
Matt Shannon
Adresse e-mail validée de google.com
Titre
Citée par
Citée par
Année
Acoustic Modeling for Google Home.
B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ...
Interspeech, 399-403, 2017
2012017
Recurrent neural aligner: An encoder-decoder neural network model for sequence to sequence mapping.
H Sak, M Shannon, K Rao, F Beaufays
Interspeech 8, 1298-1302, 2017
1432017
Location-relative attention mechanisms for robust long-form speech synthesis
E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1222020
Autoregressive models for statistical parametric speech synthesis
M Shannon, H Zen, W Byrne
IEEE transactions on audio, speech, and language processing 21 (3), 587-597, 2012
822012
Optimizing expected word error rate via sampling for speech recognition
M Shannon
arXiv preprint arXiv:1706.02776, 2017
622017
Semi-supervised generative modeling for controllable speech synthesis
R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ...
arXiv preprint arXiv:1910.01709, 2019
572019
Effective use of variational embedding capacity in expressive end-to-end speech synthesis
E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ...
arXiv preprint arXiv:1906.03402, 2019
562019
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech.
GE Henter, T Merritt, M Shannon, C Mayo, S King
Interspeech, 1504-1508, 2014
492014
Improved End-of-Query Detection for Streaming Speech Recognition.
M Shannon, G Simko, SY Chang, C Parada
Interspeech, 1909-1913, 2017
472017
Speaker generation
D Stanton, M Shannon, S Mariooryad, RJ Skerry-Ryan, E Battenberg, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
292022
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project
M Wester, J Dines, M Gibson, H Liang, YJ Wu, L Saheer, S King, K Oura, ...
Proceedings of the 7th ISCA Speech Synthesis Workshop, 2010
282010
The effect of using normalized models in statistical speech synthesis
M Shannon, H Zen, W Byrne
ISCA (International Speech Communication Association), 2011
222011
Non-saturating GAN training as divergence minimization
M Shannon, B Poole, S Mariooryad, T Bagby, E Battenberg, D Kao, ...
arXiv preprint arXiv:2010.08029, 2020
182020
Personalising speech-to-speech translation in the EMIME project
M Kurimo, B Byrne, J Dines, PN Garner, M Gibson, Y Guan, T Hirsimäki, ...
Proceedings of the ACL 2010 System Demonstrations, 48-53, 2010
172010
Fast, low-artifact speech synthesis considering global variance
M Shannon, W Byrne
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
162013
Autoregressive HMMs for speech synthesis
SM Shannon, WJ Byrne
ISCA (International Speech Communication Association), 2009
162009
A formulation of the autoregressive HMM for speech synthesis
M Shannon, W Byrne
Department of Engineering, University of Cambridge, 2009
132009
Autoregressive clustering for HMM speech synthesis
M Shannon, W Byrne
Proc. Interspeech, 2011
82011
Global normalization for streaming speech recognition in a modular framework
E Variani, K Wu, MD Riley, D Rybach, M Shannon, C Allauzen
Advances in Neural Information Processing Systems 35, 4257-4269, 2022
62022
Properties of f-divergences and f-GAN training
M Shannon
arXiv preprint arXiv:2009.00757, 2020
62020
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20