 | Mahadevan, Sridhar |
 |
 | Mahadevan, Sridhar -- Quantifying Prior Determination Knowledge Using the PAC Learning Model - 1994 |
 | Mahadevan, Sridhar -- To discount or not to discount in reinforcement learning: a case study comparing R learning and Q learning - 1994 |
 | Mahadevan, Sridhar -- Sensitive discount optimality: unifying discounted and average reward reinforcement learning - 1996 |
 | Mahadevan, Sridhar -- Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results - 1996 |
 | Mahadevan, Sridhar -- Self-improving factory simulation using continuous-time average-reward reinforcement learning - 1997 |
|