Volgen
Xubo Liu
Titel
Geciteerd door
Geciteerd door
Jaar
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
H Liu, Z Chen, Y Yuan, X Mei, X Liu, D Mandic, W Wang, MD Plumbley
International Conference on Machine Learning (ICML), 2023, 2023
4072023
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
H Liu, Q Tian, Y Yuan, X Liu, X Mei, Q Kong, Y Wang, W Wang, Y Wang, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
138*2024
Audio Captioning Transformer
X Mei, X Liu, Q Huang, MD Plumbley, W Wang
Proceedings of the Detection and Classification of Acoustic Scenes and …, 2021
812021
On Metric Learning for Audio-Text Cross-Modal Retrieval
X Mei, X Liu, J Sun, MD Plumbley, W Wang
INTERSPEECH 2022, 2022
642022
Separate What You Describe: Language-Queried Audio Source Separation
X Liu, H Liu, Q Kong, X Mei, J Zhao, Q Huang, MD Plumbley, W Wang
INTERSPEECH 2022, 2022
58*2022
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning
X Liu, T Iqbal, J Zhao, Q Huang, MD Plumbley, W Wang
2021 IEEE 31st International Workshop on Machine Learning for Signal …, 2021
532021
An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning
X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ...
Proceedings of the Detection and Classification of Acoustic Scenes and …, 2021
522021
Automated Audio Captioning: An Overview of Recent Progress and New Challenges
X Mei, X Liu, MD Plumbley, W Wang
EURASIP Journal on Audio, Speech, and Music Processing 2022 (1), 1-18, 2022
482022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
H Liu*, X Liu*, Q Kong, Q Tian, Y Zhao, DL Wang
INTERSPEECH 2022, 2022
46*2022
Separate Anything You Describe
X Liu, Q Kong, Y Zhao, H Liu, Y Yuan, Y Liu, R Xia, Y Wang, MD Plumbley, ...
arXiv preprint arXiv:2308.05037, 2023
422023
Neural Vocoder is All You Need for Speech Super-resolution
H Liu, W Choi, X Liu, Q Kong, Q Tian, DL Wang
INTERSPEECH 2022, 2022
362022
CL4AC: A Contrastive Loss for Audio Captioning
X Liu*, Q Huang*, X Mei, T Ko, HL Tang, MD Plumbley, W Wang
Proceedings of the Detection and Classification of Acoustic Scenes and …, 2021
332021
Leveraging Pre-trained BERT for Audio Captioning
X Liu, X Mei, Q Huang, J Sun, J Zhao, H Liu, MD Plumbley, V Kılıç, ...
EUSIPCO 2022, 2022
322022
Diverse Audio Captioning via Adversarial Training
X Mei, X Liu, J Sun, MD Plumbley, W Wang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
312022
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Q Huang, T Ko, HL Tang, X Liu, B Wu
INTERSPEECH 2021, 2021
262021
WavJourney: Compositional Audio Creation with Large Language Models
X Liu, Z Zhu, H Liu, Y Yuan, M Cui, Q Huang, J Liang, Y Cao, Q Kong, ...
arXiv preprint arXiv:2307.14335, 2023
21*2023
Language-based Audio Retrieval With Pre-trained Models
X Mei, X Liu, H Liu, J Sun, MD Plumbley, W Wang
DCASE2022 Challenge, Tech. Rep, 2022
212022
Visually-Aware Audio Captioning with Adaptive Audio-Visual Attention
X Liu, Q Huang, X Mei, H Liu, Q Kong, J Sun, S Li, T Ko, Y Zhang, ...
INTERSPEECH 2023, 2023
20*2023
An Encoder-decoder Based Audio Captioning System with Transfer and Reinforcement Learning for DCASE Challenge 2021 Task 6
X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ...
DCASE2021 Challenge, Tech. Rep, 2021
19*2021
SynthVSR: Scaling Up Visual Speech Recognition with Synthetic Supervision
X Liu, E Lakomkin, K Vougioukas, P Ma, H Chen, R Xie, M Doulaty, ...
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR …, 2023
162023
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20