Follow
Boris Belousov
Title
Cited by
Cited by
Year
Self-paced contextual reinforcement learning
P Klink, H Abdulsamad, B Belousov, J Peters
Conference on Robot Learning, 513-529, 2020
342020
Catching heuristics are optimal control policies
B Belousov, G Neumann, CA Rothkopf, J Peters
NIPS, 1426-1434, 2016
302016
Entropic risk measure in policy search
D Nass, B Belousov, J Peters
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019
282019
HJB optimal feedback control with deep differential value functions and action constraints
M Lutter, B Belousov, K Listmann, D Clever, J Peters
Conference on Robot Learning, 640-650, 2020
172020
Learn2assemble with structured representations and search for robotic architectural construction
N Funk, G Chalvatzaki, B Belousov, J Peters
Conference on Robot Learning, 1401-1411, 2022
162022
Entropic Regularization of Markov Decision Processes
B Belousov, J Peters
Entropy 21 (7), 2019
152019
Neural posterior domain randomization
F Muratore, T Gruner, F Wiese, B Belousov, M Gienger, J Peters
Conference on Robot Learning, 1532-1542, 2022
132022
Receding horizon curiosity
M Schultheis, B Belousov, H Abdulsamad, J Peters
Conference on robot learning, 1278-1288, 2020
122020
A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning.
P Klink, H Abdulsamad, B Belousov, C D'Eramo, J Peters, J Pajarinen
J. Mach. Learn. Res. 22, 182:1-182:52, 2021
112021
Building a Library of Tactile Skills Based on FingerVision
B Belousov, A Sadybakasov, B Wibranek, F Veiga, O Tessmann, J Peters
2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids), 2019
112019
Robotic architectural assembly with tactile skills: Simulation and optimization
B Belousov, B Wibranek, J Schneider, T Schneider, G Chalvatzaki, ...
Automation in Construction 133, 104006, 2022
92022
f-Divergence constrained policy improvement
B Belousov, J Peters
arXiv preprint arXiv:1801.00056, 2017
92017
Reinforcement Learning Algorithms: Analysis and Applications
B Belousov, H Abdulsamad, P Klink, S Parisi, J Peters
Springer Nature, 2021
72021
Reinforcement Learning for Sequential Assembly of SL-Blocks: Self-Interlocking Combinatorial Design Based on Machine Learning
B Wibranek, Y Liu, N Funk, B Belousov, J Peters, O Tessmann
Proceedings of the 39th eCAADe Conference 1, 27-36, 2021
52021
Distributionally robust trajectory optimization under uncertain dynamics via relative-entropy trust regions
H Abdulsamad, T Dorau, B Belousov, JJ Zhu, J Peters
arXiv preprint arXiv:2103.15388, 2021
42021
Interactive Structure: Robotic Repositioning of Vertical Elements in Man-Machine Collaborative Assembly through Vision-Based Tactile Sensing
B Wibranek, B Belousov, A Sadybakasov, J Peters, O Tessmann
37th eCAADe and 23rd SIGraDi Conference 2, 705-713, 2019
32019
Continuous-time fitted value iteration for robust policies
M Lutter, B Belousov, S Mannor, D Fox, A Garg, J Peters
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022
22022
Underactuated waypoint trajectory optimization for light painting photography
C Eilers, J Eschmann, R Menzenbach, B Belousov, F Muratore, J Peters
2020 IEEE International Conference on Robotics and Automation (ICRA), 1505-1510, 2020
22020
Belief space model predictive control for approximately optimal system identification
B Belousov, H Abdulsamad, M Schultheis, J Peters
4th Multidisciplinary Conference on Reinforcement Learning and Decision Making, 2019
2*2019
Active exploration for robotic manipulation
T Schneider, B Belousov, G Chalvatzaki, D Romeres, DK Jha, J Peters
arXiv preprint arXiv:2210.12806, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20