Volgen
Sayak Ray Chowdhury
Sayak Ray Chowdhury
Postdoctoral Researcher, Microsoft Research
Geverifieerd e-mailadres voor microsoft.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
On kernelized multi-armed bandits
SR Chowdhury, A Gopalan
International Conference on Machine Learning, 844-853, 2017
4212017
Misspecified linear bandits
A Ghosh, SR Chowdhury, A Gopalan
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
622017
Online learning in kernelized markov decision processes
SR Chowdhury, A Gopalan
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
462019
Bayesian optimization under heavy-tailed payoffs
S Ray Chowdhury, A Gopalan
Advances in Neural Information Processing Systems 32, 2019
252019
No-regret algorithms for multi-task bayesian optimization
SR Chowdhury, A Gopalan
International Conference on Artificial Intelligence and Statistics, 1873-1881, 2021
182021
Shuffle private linear contextual bandits
SR Chowdhury, X Zhou
International Conference in Machine Learning, 2022., 2022
162022
Bregman deviations of generic exponential families
SR Chowdhury, P Saux, O Maillard, A Gopalan
The Thirty Sixth Annual Conference on Learning Theory, 394-449, 2023
132023
Differentially private regret minimization in episodic markov decision processes
SR Chowdhury, X Zhou
Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 6375-6383, 2022
132022
Distributed Differential Privacy in Multi-Armed Bandits
SR Chowdhury, X Zhou
ICLR 2023, 2022
122022
On differentially private federated linear contextual bandits
X Zhou, SR Chowdhury
arXiv preprint arXiv:2302.13945, 2023
112023
Value Function Approximations via Kernel Embeddings for No-Regret Reinforcement Learning
SR Chowdhury, R Oliveira
Asian Conference on Machine Learning, 249-264, 2023
9*2023
Adaptive control of differentially private linear quadratic systems
SR Chowdhury, X Zhou, N Shroff
2021 IEEE International Symposium on Information Theory (ISIT), 485-490, 2021
82021
Reinforcement learning in parametric mdps with exponential families
SR Chowdhury, A Gopalan, OA Maillard
International Conference on Artificial Intelligence and Statistics, 1855-1863, 2021
82021
Active learning of conditional mean embeddings via bayesian optimisation
SR Chowdhury, R Oliveira, F Ramos
Conference on Uncertainty in Artificial Intelligence, 1119-1128, 2020
82020
Model Selection in Reinforcement Learning with General Function Approximations
A Ghosh, SR Chowdhury
ECML-PKDD, 2022, 2022
6*2022
On Batch Bayesian Optimization
SR Chowdhury, A Gopalan
arXiv preprint arXiv:1911.01032, 2019
52019
Provably Sample Efficient RLHF via Active Preference Optimization
N Das, S Chakraborty, A Pacchiano, SR Chowdhury
arXiv preprint arXiv:2402.10500, 2024
22024
GAR-meets-RAG Paradigm for Zero-Shot Information Retrieval
D Arora, A Kini, SR Chowdhury, N Natarajan, G Sinha, A Sharma
arXiv preprint arXiv:2310.20158, 2023
22023
Differentially Private Reward Estimation from Preference Based Feedback
SR Chowdhury, X Zhou
ICML 2023 Workshop The Many Facets of Preference-Based Learning, 2023
22023
A game theoretic approach to robust optimization
SR Chowdhury
M.E. dissertation, Indian Institute of Science Bangalore, 2015
22015
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20