Publications

Contextual Bandits with Online Neural Regression.
Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee
Accepted at 12th International Conference on Learning Representations (ICLR), 2024 | arxiv
Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources.
Rohan Deb, Aadirupa Saha, Arindam Banerjee
Accepted at 27th International Conference on Artificial Intelligence and Statistics (AISTATS), 2024 | arxiv
Does Momentum Help in Stochastic Optimization? A sample complexity Analysis.
Swetha Ganesh∗, Rohan Deb∗, Gugan Thoppe, Amarjit Buddhiraja
Accepted at 39th Conference on Uncertainty in Artificial Intelligence (UAI), 2023 | UAI | arxiv
Gradient Temporal Difference with Momentum: Stability and Convergence.
Rohan Deb, Shalabh Bhatnagar
Accepted at 36th AAAI Conference on Artificial Intelligence, 2022 | arxiv | AAAI
Schedule Based Temporal Difference Algorithms.
Rohan Deb∗, Meet Gandhi∗, Shalabh Bhatnagar
Accepted at 58th Annual Allerton Conference on Communication, Control, and Computing, 2022 | IEEE | arxiv
N -Timescale Stochastic Approximation: Stability and Convergence.
Rohan Deb, Shalabh Bhatnagar | arxiv

(∗ Equal Contribution)