Следене
Hannah Rose Kirk
Hannah Rose Kirk
Потвърден имейл адрес: oii.ox.ac.uk - Начална страница
Заглавие
Позовавания
Позовавания
Година
Bias out-of-the-box: An empirical analysis of intersectional occupational biases in popular generative language models
HR Kirk, Y Jun, F Volpin, H Iqbal, E Benussi, F Dreyer, A Shtedritski, ...
Advances in neural information processing systems 34, 2611-2624, 2021
43*2021
Hatemoji: A test suite and adversarially-generated dataset for benchmarking and detecting emoji-based hate
HR Kirk, B Vidgen, P Röttger, T Thrush, SA Hale
Proceedings of the 2022 Conference of the North American Chapter of the …, 2021
162021
A prompt array keeps the bias away: Debiasing vision-language models with adversarial learning
H Berg, SM Hall, Y Bhalgat, W Yang, HR Kirk, A Shtedritski, M Bain
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the …, 2022
102022
Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset
HR Kirk, Y Jun, P Rauba, G Wachtel, R Li, X Bai, N Broestl, M Doff-Sotta, ...
Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021), 2021
102021
Handling and Presenting Harmful Text in NLP
HR Kirk, A Birhane, B Vidgen, L Derczynski
EMNLP Findings, 2022
5*2022
The nuances of Confucianism in technology policy: An inquiry into the interaction between cultural and political systems in Chinese digital ethics
HR Kirk, K Lee, C Micallef
International Journal of Politics, Culture, and Society, 1-24, 2020
52020
Auditing large language models: a three-layered approach
J Mökander, J Schuett, HR Kirk, L Floridi
arXiv preprint arXiv:2302.08500, 2023
32023
Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements
C Borchers, DS Gala, B Gilburt, E Oravkin, W Bounsi, YM Asano, HR Kirk
Proceedings of the 4th workshop on gender bias in natural language …, 2022
32022
Is More Data Better? Re-thinking the Importance of Efficiency in Abusive Language Detection with Transformers-Based Active Learning
HR Kirk, B Vidgen, SA Hale
Proceedings of the Third Workshop on Threat, Aggression and Cyberbullying …, 2022
22022
Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback
HR Kirk, B Vidgen, P Röttger, SA Hale
arXiv preprint arXiv:2303.05453, 2023
2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
HR Kirk, W Yin, B Vidgen, P Röttger
arXiv preprint arXiv:2303.04222, 2023
2023
Proceedings of the First Workshop on Dynamic Adversarial Data Collection
M Bartolo, H Kirk, P Rodriguez, K Margatina, T Thrush, R Jia, P Stenetorp, ...
Proceedings of the First Workshop on Dynamic Adversarial Data Collection, 2022
2022
The mediation of matchmaking: a comparative study of gender and generational preference in online dating websites and offline blind date markets in Chengdu
HR Kirk, S Gupta
The Journal of Chinese Sociology 9 (1), 2, 2022
2022
China’s AI Policy: An NLP Approach to Assessing China’s Priorities and Governance
H Bailey, HR Kirk, P Howard
2022
Cooperation and Creed: An Experimental Study of Religious Affiliation in Strategic and Societal Interactions
H Kirk
2019
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–15