Dynamics-aware unsupervised discovery of skills A Sharma, S Gu, S Levine, V Kumar, K Hausman International Conference on Learning Representations (ICLR), 2020, 2019 | 325 | 2019 |
Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning A Sharma, M Ahn, S Levine, V Kumar, K Hausman, S Gu Robotics: Science and Systems (RSS), 2020 | 39 | 2020 |
Variational empowerment as representation learning for goal-based reinforcement learning J Choi, A Sharma, H Lee, S Levine, SS Gu arXiv preprint arXiv:2106.01404, 2021 | 33* | 2021 |
Direct Preference Optimization: Your Language Model is Secretly a Reward Model R Rafailov, A Sharma, E Mitchell, S Ermon, CD Manning, C Finn arXiv preprint arXiv:2305.18290, 2023 | 25 | 2023 |
Autonomous Reinforcement Learning via Subgoal Curricula A Sharma, A Gupta, S Levine, K Hausman, C Finn Thirty-Fifth Conference on Neural Information Processing Systems, 2021 | 22 | 2021 |
Autonomous Reinforcement Learning: Formalism and Benchmarking A Sharma, K Xu, N Sardana, A Gupta, K Hausman, S Levine, C Finn arXiv preprint arXiv:2112.09605, 2021 | 15 | 2021 |
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning A Sharma, R Ahmad, C Finn arXiv preprint arXiv:2205.05212, 2022 | 10 | 2022 |
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback K Tian, E Mitchell, A Zhou, A Sharma, R Rafailov, H Yao, C Finn, ... arXiv preprint arXiv:2305.14975, 2023 | 7 | 2023 |
A flexible probabilistic framework for large-margin mixture of experts A Sharma, S Saxena, P Rai Machine Learning 108 (8-9), 1369-1393, 2019 | 5 | 2019 |
When to ask for help: Proactive interventions in autonomous reinforcement learning A Xie, F Tajwar, A Sharma, C Finn Advances in Neural Information Processing Systems 35, 16918-16930, 2022 | 4 | 2022 |
Dynamics-aware unsupervised skill discovery A Sharma, S Gu, S Levine, V Kumar, K Hausman Proceedings of the International Conference on Learning Representations (ICLR), 2019 | 4 | 2019 |
You Only Live Once: Single-Life Reinforcement Learning A Chen, A Sharma, S Levine, C Finn Advances in Neural Information Processing Systems 35, 14784-14797, 2022 | 3 | 2022 |
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning A Sharma, AM Ahmed, R Ahmad, C Finn arXiv preprint arXiv:2303.01488, 2023 | 1 | 2023 |
Discriminator Augmented Model-Based Reinforcement Learning B Haghgoo, A Zhou, A Sharma, C Finn arXiv preprint arXiv:2103.12999, 2021 | 1 | 2021 |
Waypoint-Based Imitation Learning for Robotic Manipulation LX Shi, A Sharma, TZ Zhao, C Finn arXiv preprint arXiv:2307.14326, 2023 | | 2023 |