Florian Metze

Cited by

	All	Since 2019
Citations	11413	6500
h-index	55	38
i10-index	194	120

1700

850

425

1275

2002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202458 88 99 107 133 116 106 137 148 176 227 350 468 571 649 642 744 852 897 1081 1488 1675 497

Public access

View all

45 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Alexander WaibelCarnegie Mellon, KIT, Karlsruhe Institute of Technology, University of KarlsruheVerified email at cs.cmu.edu
Yajie MiaoCarnegie Mellon UniversityVerified email at cs.cmu.edu
Tanja SchultzProfessor of Computer Science, University BremenVerified email at uni-bremen.de
Billy li (Juncheng)Carnegie Mellon UniversityVerified email at cs.cmu.edu
Alan W BlackProfessor, Language Technologies Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Shruti PalaskarAppleVerified email at apple.com
Ramon SanabriaThe University of EdinburghVerified email at ed.ac.uk
Hagen SoltauGoogle DeepMindVerified email at google.com
Siddharth DalmiaResearch Scientist, Google DeepMindVerified email at google.com
Tim PolzehlGerman Research Center for Artificial IntelligenceVerified email at dfki.de
Po-Yao (Bernie) HuangFAIR, MetaVerified email at fb.com
Xinjian LiGoogleVerified email at google.com
Alex HauptmannCarnegie Mellon UniversityVerified email at cs.cmu.edu
Yun Wang (Maigo)Research Scientist at FacebookVerified email at fb.com
Shourabh RawatCarnegie Mellon University (CMU)Verified email at cs.cmu.edu
Shuhui QuStanford UniversityVerified email at stanford.edu
Xavier AngueraELSA Corp.Verified email at elsanow.io
Sebastian StükerZoom Video Communications Inc.Verified email at kit.edu
Thomas SchaafCarnegie Mellon UniversityVerified email at cs.cmu.edu
laurent besacierProfessor in Computer ScienceVerified email at imag.fr

Florian Metze

Carnegie Mellon University; Meta AI

Verified email at andrew.cmu.edu - Homepage

speech recognition video understanding


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding Y Miao, M Gowayyed, F Metze 2015 IEEE workshop on automatic speech recognition and understanding (ASRU …, 2015	932	2015
Videoclip: Contrastive pre-training for zero-shot video-text understanding H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ... arXiv preprint arXiv:2109.14084, 2021	397	2021
Extracting deep bottleneck features using stacked auto-encoders J Gehring, Y Miao, F Metze, A Waibel 2013 IEEE international conference on acoustics, speech and signal …, 2013	373	2013
Learning joint embedding with multimodal cues for cross-modal video-text retrieval NC Mithun, J Li, F Metze, AK Roy-Chowdhury Proceedings of the 2018 ACM on international conference on multimedia …, 2018	272	2018
How2: a large-scale dataset for multimodal language understanding R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ... arXiv preprint arXiv:1811.00347, 2018	257	2018
A one-pass decoder based on polymorphic linguistic context assignment H Soltau, F Metze, C Fugen, A Waibel IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU …, 2001	249	2001
Support-set bottlenecks for video-text representation learning M Patrick, PY Huang, Y Asano, F Metze, A Hauptmann, J Henriques, ... arXiv preprint arXiv:2010.02824, 2020	247	2020
Keeping your eye on the ball: Trajectory attention in video transformers M Patrick, D Campbell, Y Asano, I Misra, F Metze, C Feichtenhofer, ... Advances in neural information processing systems 34, 12493-12506, 2021	221	2021
Comparison of four approaches to age and gender recognition for telephone applications F Metze, J Ajmera, R Englert, U Bub, F Burkhardt, J Stegmann, C Muller, ... 2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007	198	2007
A comparison of five multiple instance learning pooling functions for sound event detection with weak labeling Y Wang, J Li, F Metze ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	196	2019
Advances in automatic meeting record creation and access A Waibel, M Bett, F Metze, K Ries, T Schaaf, T Schultz, H Soltau, H Yu, ... 2001 IEEE International Conference on Acoustics, Speech, and Signal …, 2001	181	2001
Session independent non-audible speech recognition using surface electromyography L Maier-Hein, F Metze, T Schultz, A Waibel IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., 331-336, 2005	165	2005
A comparison of deep learning methods for environmental sound detection J Li, W Dai, F Metze, S Qu, S Das 2017 IEEE International conference on acoustics, speech and signal …, 2017	163	2017
Masked autoencoders that listen PY Huang, H Xu, J Li, A Baevski, M Auli, W Galuba, F Metze, ... Advances in Neural Information Processing Systems 35, 28708-28720, 2022	161	2022
Speaker adaptive training of deep neural network acoustic models using i-vectors Y Miao, H Zhang, F Metze IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (11 …, 2015	143	2015
How2sign: a large-scale multimodal dataset for continuous american sign language A Duarte, S Palaskar, L Ventura, D Ghadiyaram, K DeHaan, F Metze, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	136	2021
Deep maxout networks for low-resource speech recognition Y Miao, F Metze, S Rawat 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 398-403, 2013	126	2013
Anger recognition in speech using acoustic and linguistic cues T Polzehl, A Schmitt, F Metze, M Wagner Speech Communication 53 (9-10), 1198-1209, 2011	124	2011
Effective dimensionality reduction for word embeddings V Raunak, V Gupta, F Metze Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP …, 2019	120	2019
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition A Jansen, E Dupoux, S Goldwater, M Johnson, S Khudanpur, K Church, ... 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	119	2013

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors