A Wavenet for speech denoising D Rethage, J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 | 543 | 2018 |
Fsd50k: an open dataset of human-labeled sound events E Fonseca, X Favory, J Pons, F Font, X Serra IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 829-852, 2021 | 450 | 2021 |
Freesound Datasets: a platform for the creation of open audio datasets E Fonseca, J Pons, X Favory, F Font, D Bogdanov, A Ferraro, S Oramas, ... International Society for Music Information Retrieval Conference (ISMIR), 2017 | 269 | 2017 |
End-to-end learning for music audio tagging at scale J Pons, O Nieto, M Prockup, E Schmidt, A Ehmann, X Serra International Society for Music Information Retrieval Conference (ISMIR), 2018 | 251 | 2018 |
General-purpose tagging of freesound audio with audioset labels: Task description, dataset, and baseline E Fonseca, M Plakal, F Font, DPW Ellis, X Favory, J Pons, X Serra DCASE Workshop, 2018 | 202 | 2018 |
Experimenting with musically motivated convolutional neural networks J Pons, T Lidy, X Serra International Workshop on Content-Based Multimedia Indexing (CBMI), 1-6, 2016 | 201 | 2016 |
Timbre analysis of music audio signals with convolutional neural networks J Pons, O Slizovskaia, E Gómez Gutiérrez, X Serra European Signal Processing Conference (EUSIPCO), 2813-7, 2017 | 164 | 2017 |
Randomly weighted CNNs for (music) audio classification J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019 | 119 | 2019 |
MusiCNN: pre-trained convolutional neural networks for music audio tagging J Pons, X Serra Late breaking/demo session of the International Society for Music …, 2019 | 113 | 2019 |
End-to-end music source separation: Is it possible in the waveform domain? F Lluís, J Pons, X Serra arXiv preprint arXiv:1810.12187, 2018 | 91 | 2018 |
Universal speech enhancement with score-based diffusion J Serrà, S Pascual, J Pons, RO Araz, D Scaini arXiv preprint arXiv:2206.03065, 2022 | 88 | 2022 |
Designing efficient architectures for modeling temporal features with convolutional neural networks J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017 | 85 | 2017 |
Training neural audio classifiers with few data J Pons, J Serrà, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019 | 77 | 2019 |
Upsampling artifacts in neural audio synthesis J Pons, S Pascual, G Cengarle, J Serrà ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 70 | 2021 |
Remixing music using source separation algorithms to improve the musical experience of cochlear implant users J Pons, J Janer, T Rode, W Nogueira The Journal of the Acoustical Society of America 140 (6), 4338-4349, 2016 | 69 | 2016 |
Automatic multitrack mixing with a differentiable mixing console of neural audio effects CJ Steinmetz, J Pons, S Pascual, J Serra ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 56 | 2021 |
An empirical study of Conv-TasNet B Kadioglu, M Horgan, X Liu, J Pons, D Darcy, V Kumar International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020 | 53* | 2020 |
SESQA: semi-supervised learning for speech quality assessment J Serrà, J Pons, S Pascual ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 52 | 2021 |
On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence A Roebel, J Pons, M Liuni, M Lagrange International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015 | 43 | 2015 |
Fast timing-conditioned latent audio diffusion Z Evans, CJ Carr, J Taylor, SH Hawley, J Pons arXiv preprint arXiv:2402.04825, 2024 | 42 | 2024 |