- Recent talks:
- 7 December 2020: Referential Cohesion: A Challenge for MT Evaluation, keynote talk at the Second International Workshop of Discourse Processing at AACL-IJCNLP 2020.
- 3 December 2020: Referential Cohesion: A Challenge for MT Evaluation, Lectures on Language Technology and Translation, Uppsala University, Department of Linguistics and Philology.
- 2 September 2020: Good Pronoun Translation: What is it and how do we recognise it?, IT University of Copenhagen, Department of Computer Science.
- 13 July 2020: Characterising Cross-Lingual Variation in Pronoun Use: Methods and Approaches. University of Luxembourg, Department of Humanities.
- 2 April 2020: Good Pronoun Translation: What is it and how do we recognise it?, University of Edinburgh, School of Informatics.
- I’m on the organisation committee of the following two workshops in 2021:
- Second Workshop on Gender Bias in Natural Language Processing (GeBNLP 2021) at COLING 2020 in Bangkok, 5/6 August 2021.
- First Workshop on Computational Approaches to Discourse (CODI 2021) at EMNLP 2021 in Punta Cana, November 2021.
I’m an Associate Professor of Computer Science at the IT University of Copenhagen, where I’m a part of the NLPnorth natural language processing group. I’m also affiliated with the Computational Linguistics Group at Uppsala University as a Researcher and Associate Professor (Docent) in Computational Linguistics. I’m currently staying in Edinburgh, where I’ve spent 1.5 years as a visitor and Senior Researcher at the School of Informatics of the University of Edinburgh. From 2009 to 2011, I was a member of the machine translation group at Fondazione Bruno Kessler in Trento.
I hold a PhD in Computational Linguistics from Uppsala University. My PhD thesis was on Discourse in Statistical Machine Translation and received the Best Thesis Award of the European Association for Machine Translation in 2015. My supervisors were Joakim Nivre, Jörg Tiedemann and Marcello Federico. I also have an MA in Nordic Philology from the University of Basel.
I work in computational linguistics, and my research is at the intersection of statistical natural language processing, machine translation, translation studies and text linguistics. My goal is to create NLP systems with a higher awareness of linguistic context and non-linguistic aspects of communicative situations. I am also interested in studying high-level problems in translation using methods from statistical NLP and machine translation.
I’m particularly curious about how referring expressions such as pronouns (like she, it, they or this) and lexical noun phrases (like a cat, the house, scrambled eggs or my research) are used across languages, how human translators treat them, what machine translation systems should do with them and how we can use multilingual data to help us interpret them automatically.
Here are some of the angles from which I’ve approached these problems in the last few years:
- Discourse-level MT and its evaluation (1, 2, 3)
- Automatic discovery of discourse-related language contrasts in human-translated parallel corpora (4, 5)
- Cross-lingual studies of the generation and interpretation of referring expressions with human subjects (6, 7)
- Coreference annotation of multilingual corpora (8, 9)
- Neural language modelling (10) and cross-lingual pronoun prediction (11, 12)
I’m a member of the organisation committee of the Workshop on Gender Bias in Natural Language Processing (GeBNLP) at ACL 2019, COLING 2020 and ACL-IJCNLP 2021 and of the Workshop on Computational Approaches to Discourse (CODI) at EMNLP 2020 and EMNLP 2021. I also co-organised the Workshop on Discourse in MT (DiscoMT), last held at EMNLP-IJCNLP 2019.