Unifying count-based exploration and intrinsic motivation M Bellemare, S Srinivasan, G Ostrovski, T Schaul, D Saxton, R Munos Advances in neural information processing systems 29, 2016 | 1783 | 2016 |
Emergence of locomotion behaviours in rich environments N Heess, D Tb, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ... arXiv preprint arXiv:1707.02286, 2017 | 1165 | 2017 |
OpenSpiel: A framework for reinforcement learning in games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019 | 280 | 2019 |
Learning human behaviors from motion capture by adversarial imitation J Merel, Y Tassa, D TB, S Srinivasan, J Lemmon, Z Wang, G Wayne, ... arXiv preprint arXiv:1707.02201, 2017 | 247 | 2017 |
Actor-critic policy optimization in partially observable multiagent environments S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ... Advances in neural information processing systems 31, 2018 | 172 | 2018 |
Emergence of locomotion behaviours in rich environments. arXiv 2017 N Heess, D TB, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ... arXiv preprint arXiv:1707.02286, 2017 | 51 | 2017 |
Segmenting web-domains and hashtags using length specific models S Srinivasan, S Bhattacharya, R Chakraborty Proceedings of the 21st ACM international conference on Information and …, 2012 | 34 | 2012 |
Domain-independent optimistic initialization for reinforcement learning MC Machado, S Srinivasan, M Bowling Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015 | 32 | 2015 |
Emergence of locomotion behaviours in rich environments (2017) N Heess, TB Dhruva, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, ... arXiv preprint arXiv:1707.02286, 2017 | 17 | 2017 |
Emergence of locomotion behaviours in rich environments. CoRR abs/1707.02286 (2017) N Heess, TB Dhruva, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, ... arXiv preprint arXiv:1707.02286, 2017 | 14 | 2017 |
Improving exploration in UCT using local manifolds S Srinivasan, E Talvitie, M Bowling Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015 | 12 | 2015 |
Learning to tokenize web domains S Srinivasan, S Bhattachaya Proceedings of the 20th international conference companion on World wide web …, 2011 | 1 | 2011 |
State Generalization in UCT S Sriram | | 2015 |
Learning Markov Networks with Bounded Inference Complexity UD Gupta, S Sriram, S Sharma, R Greiner | | 2013 |