Volgen
Xinhao Mei
Xinhao Mei
PhD student, University of Surrey, UK
Geverifieerd e-mailadres voor surrey.ac.uk - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Audioldm: Text-to-audio generation with latent diffusion models
H Liu, Z Chen, Y Yuan, X Mei, X Liu, D Mandic, W Wang, MD Plumbley
arXiv preprint arXiv:2301.12503, 2023
1202023
Audio captioning transformer
X Mei, X Liu, Q Huang, MD Plumbley, W Wang
arXiv preprint arXiv:2107.09817, 2021
532021
Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research
X Mei, C Meng, H Liu, Q Kong, T Ko, C Zhao, MD Plumbley, Y Zou, ...
arXiv preprint arXiv:2303.17395, 2023
42*2023
An encoder-decoder based audio captioning system with transfer and reinforcement learning
X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ...
arXiv preprint arXiv:2108.02752, 2021
402021
On metric learning for audio-text cross-modal retrieval
X Mei, X Liu, J Sun, MD Plumbley, W Wang
arXiv preprint arXiv:2203.15537, 2022
352022
Automated audio captioning: an overview of recent progress and new challenges
X Mei, X Liu, MD Plumbley, W Wang
EURASIP journal on audio, speech, and music processing 2022 (1), 1-18, 2022
262022
CL4AC: A contrastive loss for audio captioning
X Liu, Q Huang, X Mei, T Ko, HL Tang, MD Plumbley, W Wang
arXiv preprint arXiv:2107.09990, 2021
242021
Leveraging pre-trained BERT for audio captioning
X Liu, X Mei, Q Huang, J Sun, J Zhao, H Liu, MD Plumbley, V Kilic, ...
2022 30th European Signal Processing Conference (EUSIPCO), 1145-1149, 2022
232022
Diverse audio captioning via adversarial training
X Mei, X Liu, J Sun, MD Plumbley, W Wang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech andá…, 2022
192022
Separate what you describe: Language-queried audio source separation
X Liu, H Liu, Q Kong, X Mei, J Zhao, Q Huang, MD Plumbley, W Wang
arXiv preprint arXiv:2203.15147, 2022
192022
Language-based audio retrieval with pre-trained models
X Mei, X Liu, H Liu, J Sun, MD Plumbley, W Wang
Detection and Classification of Acoustic Scenes and Events (DCASE) Challengeá…, 2022
172022
An encoder-decoder based audio captioning system with transfer and reinforcement learning for DCASE challenge 2021 task 6
X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ...
DCASE2021 Challenge, Tech. Rep, Tech. Rep, 2021
152021
AudioLDM 2: Learning holistic audio generation with self-supervised pretraining
H Liu, Q Tian, Y Yuan, X Liu, X Mei, Q Kong, Y Wang, W Wang, Y Wang, ...
arXiv preprint arXiv:2308.05734, 2023
13*2023
Surrey system for dcase 2022 task 5: Few-shot bioacoustic event detection with segment-level metric learning
H Liu, X Liu, X Mei, Q Kong, W Wang, MD Plumbley
arXiv preprint arXiv:2207.10547, 2022
92022
Simple pooling front-ends for efficient audio classification
X Liu, H Liu, Q Kong, X Mei, MD Plumbley, W Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech andá…, 2023
82023
Visually-aware audio captioning with adaptive audio-visual attention
X Liu, Q Huang, X Mei, H Liu, Q Kong, J Sun, S Li, T Ko, Y Zhang, ...
arXiv preprint arXiv:2210.16428, 2022
82022
Deep neural decision forest for acoustic scene classification
J Sun, X Liu, X Mei, J Zhao, MD Plumbley, V Kılıš, W Wang
2022 30th European Signal Processing Conference (EUSIPCO), 772-776, 2022
82022
Automated audio captioning with keywords guidance
X Mei, X Liu, H Liu, J Sun, MD Plumbley, W Wang
Detection and Classification of Acoustic Scenes and Events (DCASE) Challengeá…, 2022
82022
Segment-level metric learning for few-shot bioacoustic event detection
H Liu, X Liu, X Mei, Q Kong, W Wang, MD Plumbley
arXiv preprint arXiv:2207.07773, 2022
52022
Towards generating diverse audio captions via adversarial training
X Mei, X Liu, J Sun, MD Plumbley, W Wang
arXiv preprint arXiv:2212.02033, 2022
22022
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20