Victoria Krakovna

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	1530	1441
h-индекс	12	12
i10-индекс	15	14

580

290

145

435

201720182019202020212022202320249 61 108 157 162 186 255 569

Публичен достъп

Преглед на всички

1 статия

0 статии

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

Tom EverittStaff Research Scientist at Google DeepMindПотвърден имейл адрес: google.com
Laurent OrseauResearch Scientist at Google DeepMindПотвърден имейл адрес: google.com
Ramana KumarDeepMindПотвърден имейл адрес: cl.cam.ac.uk
Miljan MarticDeepMindПотвърден имейл адрес: google.com
Jonathan UesatoПотвърден имейл адрес: mit.edu
Marcus HutterResearcher@DeepMind & Professor at ANUПотвърден имейл адрес: anu.edu.au
Zachary KentonGoogle DeepMindПотвърден имейл адрес: google.com
Pedro A. OrtegaArtificial Intelligence & Machine LearningПотвърден имейл адрес: adaptiveagents.org
Jan LeikeOpenAIПотвърден имейл адрес: openai.com
Richard NgoOpenAIПотвърден имейл адрес: openai.com
Matthew RahtzGoogle DeepMindПотвърден имейл адрес: google.com
Finale Doshi-VelezProfessor, HarvardПотвърден имейл адрес: seas.harvard.edu
Vladimir MikulikDeepMindПотвърден имейл адрес: google.com
Rohin ShahResearch Scientist, Google DeepMindПотвърден имейл адрес: deepmind.com
Vikrant VarmaDeepMindПотвърден имейл адрес: deepmind.com
Mary PhuongIST AustriaПотвърден имейл адрес: ist.ac.at
Janos KramarDeepMindПотвърден имейл адрес: google.com
Gerald PennProfessor of Computer Science, University of TorontoПотвърден имейл адрес: cs.toronto.edu
Jun S LiuProfessor of statistics, Harvard UniversityПотвърден имейл адрес: stat.harvard.edu
Andis DragunsIMCS UL, MATSПотвърден имейл адрес: lumii.lv

Следене

Victoria Krakovna

Други именаViktoriya Krakovna

Senior Research Scientist at DeepMind

Потвърден имейл адрес: google.com - Начална страница

AI Alignment Agent Incentives Interpretability Reinforcement Learning Machine Learning


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	520	2023
AI safety gridworlds J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ... arXiv preprint arXiv:1711.09883, 2017	330	2017
Reinforcement Learning with a Corrupted Reward Channel T Everitt, V Krakovna, L Orseau, M Hutter, S Legg IJCAI AI & Autonomy, 2017	116	2017
Specification gaming: the flip side of AI ingenuity V Krakovna, J Uesato, V Mikulik, M Rahtz, T Everitt, R Kumar, Z Kenton, ... https://deepmind.com/blog/article/Specification-gaming-the-flip-side-of-AI …, 2020	100*	2020
Reward tampering problems and solutions in reinforcement learning: A causal influence diagram perspective T Everitt, M Hutter, R Kumar, V Krakovna Synthese 198 (Suppl 27), 6435-6467, 2021	83	2021
Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models V Krakovna, F Doshi-Velez ICML Workshop on Human Interpretability (WHI 2016), arXiv preprint arXiv …, 2016	82	2016
Penalizing side effects using stepwise relative reachability V Krakovna, L Orseau, R Kumar, M Martic, S Legg arXiv preprint arXiv:1806.01186, 2018	58	2018
Goal misgeneralization: why correct specifications aren't enough for correct goals R Shah, V Varma, R Kumar, M Phuong, V Krakovna, J Uesato, Z Kenton arXiv preprint arXiv:2210.01790, 2022	46	2022
Avoiding Side Effects By Considering Future Tasks V Krakovna, L Orseau, R Ngo, M Martic, S Legg NeurIPS 2020, arXiv preprint arXiv:2010.07877, 2020	43	2020
Specification gaming examples in AI V Krakovna tinyurl.com/specification-gaming, 2018	37*	2018
Modeling AGI safety frameworks with causal influence diagrams T Everitt, R Kumar, V Krakovna, S Legg arXiv preprint arXiv:1906.08663, 2019	23	2019
Measuring and avoiding side effects using relative reachability V Krakovna, L Orseau, M Martic, S Legg arXiv preprint arXiv:1806.01186, 2018	18	2018
REALab: An embedded perspective on tampering R Kumar, J Uesato, R Ngo, T Everitt, V Krakovna, S Legg arXiv preprint arXiv:2011.08820, 2020	12	2020
Power-seeking can be probable and predictive for trained agents V Krakovna, J Kramar arXiv preprint arXiv:2304.06528, 2023	10*	2023
Memory-bounded left-corner unsupervised grammar induction on child-directed input C Shain, W Bryce, L Jin, V Krakovna, F Doshi-Velez, T Miller, W Schuler, ... Proceedings of COLING 2016, the 26th International Conference on …, 2016	10*	2016
Avoiding tampering incentives in deep RL via decoupled approval J Uesato, R Kumar, V Krakovna, T Everitt, R Ngo, S Legg arXiv preprint arXiv:2011.08827, 2020	7	2020
Interpretable selection and visualization of features and interactions using bayesian forests V Krakovna, J Du, JS Liu Statistics and its Interface 2018 (Volume 11 Number 3), arXiv preprint arXiv …, 2015	6*	2015
A generalized-zero-preserving method for compact encoding of concept lattices M Skala, V Krakovna, J Kramár, G Penn Proceedings of the 48th annual meeting of the Association for Computational …, 2010	6	2010
A Minimalistic Approach to Sum-Product Network Learning for Real Applications V Krakovna, M Looks ICLR 2016 workshop, arXiv preprint arXiv:1602.04259, 2016	5	2016
Building interpretable models: From Bayesian networks to neural networks V Krakovna	4	2016

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори