Publications
- Contextual Bandits with Online Neural Regression.
Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee
Accepted at 12th International Conference on Learning Representations (ICLR), 2024 | arxiv Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources.
Rohan Deb, Aadirupa Saha, Arindam Banerjee
Accepted at 27th International Conference on Artificial Intelligence and Statistics (AISTATS), 2024 | arxivDoes Momentum Help in Stochastic Optimization? A sample complexity Analysis.
Swetha Ganesh∗, Rohan Deb∗, Gugan Thoppe, Amarjit Buddhiraja
Accepted at 39th Conference on Uncertainty in Artificial Intelligence (UAI), 2023 | UAI | arxivGradient Temporal Difference with Momentum: Stability and Convergence.
Rohan Deb, Shalabh Bhatnagar
Accepted at 36th AAAI Conference on Artificial Intelligence, 2022 | arxiv | AAAISchedule Based Temporal Difference Algorithms.
Rohan Deb∗, Meet Gandhi∗, Shalabh Bhatnagar
Accepted at 58th Annual Allerton Conference on Communication, Control, and Computing, 2022 | IEEE | arxiv- N -Timescale Stochastic Approximation: Stability and Convergence.
Rohan Deb, Shalabh Bhatnagar | arxiv
(∗ Equal Contribution)