 Mahadevan, Sridhar 

 Mahadevan, Sridhar  Quantifying Prior Determination Knowledge Using the PAC Learning Model  1994 
 Mahadevan, Sridhar  To discount or not to discount in reinforcement learning: a case study comparing R learning and Q learning  1994 
 Mahadevan, Sridhar  Sensitive discount optimality: unifying discounted and average reward reinforcement learning  1996 
 Mahadevan, Sridhar  Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results  1996 
 Mahadevan, Sridhar  Selfimproving factory simulation using continuoustime averagereward reinforcement learning  1997 
