Следене
Henryk Michalewski
Henryk Michalewski
Потвърден имейл адрес: google.com
Заглавие
Позовавания
Позовавания
Година
Palm: Scaling language modeling with pathways
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
Journal of Machine Learning Research 24 (240), 1-113, 2023
51562023
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
22362023
Program synthesis with large language models
J Austin, A Odena, M Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ...
arXiv preprint arXiv:2108.07732, 2021
12472021
Model-based reinforcement learning for atari
L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ...
arXiv preprint arXiv:1903.00374, 2019
10302019
Gemma: Open models based on gemini research and technology
G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ...
arXiv preprint arXiv:2403.08295, 2024
7462024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
7202024
Rt-2: Vision-language-action models transfer web knowledge to robotic control
A Brohan, N Brown, J Carbajal, Y Chebotar, X Chen, K Choromanski, ...
arXiv preprint arXiv:2307.15818, 2023
6812023
Solving quantitative reasoning problems with language models
A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ...
Advances in Neural Information Processing Systems 35, 3843-3857, 2022
6072022
Show your work: Scratchpads for intermediate computation with language models
M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ...
arXiv preprint arXiv:2112.00114, 2021
5502021
Multi-game decision transformers
KH Lee, O Nachum, MS Yang, L Lee, D Freeman, S Guadarrama, ...
Advances in Neural Information Processing Systems 35, 27921-27936, 2022
2212022
Reinforcement learning of theorem proving
C Kaliszyk, J Urban, H Michalewski, M Olšák
Advances in Neural Information Processing Systems 31, 2018
1902018
Rt-2: Vision-language-action models transfer web knowledge to robotic control
B Zitkovich, T Yu, S Xu, P Xu, T Xiao, F Xia, J Wu, P Wohlhart, S Welker, ...
Conference on Robot Learning, 2165-2183, 2023
1752023
Simulation-based reinforcement learning for real-world autonomous driving
B Osiński, A Jakubowski, P Zięcina, P Miłoś, C Galias, S Homoceanu, ...
2020 IEEE international conference on robotics and automation (ICRA), 6411-6418, 2020
1652020
Promptbreeder: Self-referential self-improvement via prompt evolution
C Fernando, D Banarse, H Michalewski, S Osindero, T Rocktäschel
arXiv preprint arXiv:2309.16797, 2023
1232023
Palm: Scaling language modeling with pathways. arXiv 2022
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv:2204.02311 10, 2022
1152022
Learning to run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments
Ł Kidziński, SP Mohanty, CF Ong, Z Huang, S Zhou, A Pechenko, ...
The NIPS'17 Competition: Building Intelligent Systems, 121-153, 2018
1002018
Focused transformer: Contrastive training for context scaling
S Tworkowski, K Staniszewski, M Pacek, Y Wu, H Michalewski, P Miłoś
Advances in Neural Information Processing Systems 36, 2024
912024
Sparse is enough in scaling transformers
S Jaszczur, A Chowdhery, A Mohiuddin, L Kaiser, W Gajewski, ...
Advances in Neural Information Processing Systems 34, 9895-9907, 2021
902021
Language model cascades
D Dohan, W Xu, A Lewkowycz, J Austin, D Bieber, RG Lopes, Y Wu, ...
arXiv preprint arXiv:2207.10342, 2022
792022
Program synthesis with large language models. CoRR abs/2108.07732 (2021)
J Austin, A Odena, MI Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ...
arXiv preprint arXiv:2108.07732, 2021
552021
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20