Siddharth Chandak

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Training Efficiency & Optimization (1)

Frequent co-authors

Rahul Singh (1)Eric Moulines (1)Vivek S. Borkar (1)Nicholas Bambos (1)

Papers (1)

Feb 18, 2026

3w ago·also EPITA

Regret and Sample Complexity of Online Q-Learning via Concentration of Stochastic Approximation with Time-Inhomogeneous Markov Chains

Q-learning regret bounds can be achieved without optimism, but are highly sensitive to the suboptimality gap, motivating a new smoothed exploration strategy.

Rahul Singh, Siddharth Chandak, Eric Moulines +2

Training Efficiency & Optimization

Search

Siddharth Chandak

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)