Follow
Anya Belz
Anya Belz
Professor of Computer Science, ADAPT Research Centre, Dublin City University, Ireland
Verified email at adaptcentre.ie
Title
Cited by
Cited by
Year
Comparing automatic and human evaluation of NLG systems
A Belz, E Reiter
11th conference of the european chapter of the association for computational …, 2006
2812006
Automatic generation of weather forecast texts using comprehensive probabilistic generation-space models
A Belz
Natural Language Engineering 14 (4), 431-455, 2008
2662008
An investigation into the validity of some metrics for automatically evaluating natural language generation systems
E Reiter, A Belz
Computational Linguistics 35 (4), 529-558, 2009
2512009
Twenty Years of Confusion in Human Evaluation: NLG needs evaluation sheets and standardised definitions
D Howcroft, A Belz, D Gkatzia, S Hasan, S Mahamood, S Mille, M Clinciu, ...
International Natural Language Generation Conference 2020 (INLG'20), 2020
2002020
The first surface realisation shared task: Overview and evaluation results
A Belz, M White, D Espinosa, E Kow, D Hogan, A Stent
Proceedings of the 13th European workshop on natural language generation …, 2011
1092011
The TUNA-REG Challenge 2009: Overview and evaluation results
A Gatt, A Belz, E Kow
Association for Computational Linguistics, 2009
812009
A Systematic Review of Reproducibility Research in Natural Language Processing
A Belz, S Agarwal, A Shimorina, E Reiter
EACL'21, 2021
802021
Introducing shared tasks to NLG: The TUNA shared task evaluation challenges
A Gatt, A Belz
Conference of the European Association for Computational Linguistics, 264-293, 2009
792009
Intrinsic vs. extrinsic evaluation measures for referring expression generation
A Belz, A Gatt
Proceedings of ACL-08: HLT, Short Papers, 197-200, 2008
722008
The First Multilingual Surface Realisation Shared Task (SR'18): Overview and Evaluation Results
S Mille, A Belz, B Bohnet, Y Graham, E Pitler, L Wanner
Proceedings of the ACL'18 Workshop on Multilingual Surface Realisation …, 2018
682018
Disentangling the Properties of Human Evaluation Methods: A Classification System to Support Comparability, Meta-Evaluation and Reproducibility Testing
A Belz, S Mille, D Howcroft
International Natural Language Generation Conference 2020 (INLG'20), 2020
672020
The TUNA challenge 2008: Overview and evaluation results
A Gatt, A Belz, E Kow
Association for Computational Linguistics, 2008
622008
The attribute selection for GRE challenge: Overview and evaluation results
A Belz, A Gatt
Proceedings of the Workshop on Using corpora for natural language generation, 2007
582007
That's nice… what can you do with it?
A Belz
Computational Linguistics 35 (1), 2009
52*2009
ITRI-02-04 PILLS: Multilingual generation of medical information documents with overlapping content
N Bouayad-Agha, R Power, D Scott, A Belz
Proceedings of LREC 2002, 22-31, 2002
522002
Generating referring expressions in context: The GREC task evaluation challenges
A Belz, E Kow, J Viethen, A Gatt
Conference of the European Association for Computational Linguistics, 294-327, 2009
472009
The Second Multilingual Surface Realisation Shared Task (SR‘19): Overview and Evaluation Results
S Mille, A Belz, B Bohnet, Y Graham, L Wanner
Proceedings of the 2nd Workshop on Multilingual Surface Realisation, 2019
442019
Missing information, unresponsive authors, experimental flaws: The impossibility of assessing the reproducibility of previous human evaluations in NLP
A Belz, C Thomson, E Reiter, G Abercrombie, JM Alonso-Moral, M Arvan, ...
arXiv preprint arXiv:2305.01633, 2023
42*2023
The ReproGen Shared Task on Reproducibility of Human Evaluations in NLG: Overview and Results
A Belz, A Shimorina, S Agarwal, E Reiter
Proceedings of the 14th International Natural Language Generation Conference …, 2021
412021
The human evaluation datasheet 1.0: A template for recording details of human evaluation experiments in nlp
A Shimorina, A Belz
arXiv preprint arXiv:2103.09710, 2021
412021
The system can't perform the operation now. Try again later.
Articles 1–20