About me

News

  • 12 December 2019: As of today, I’m a Docent in Computational linguistics at Uppsala University.
  • Recent talks:
    • 3 December 2019: Referring Expressions and Multilingual Language Technology, Chalmers University of Technology, Gothenburg, Data Science and AI Division.
    • 22 November 2019: Discourse Phenomena in Machine Translations, Saarland University, Department of Language Science and Technology.
    • 19 November 2019: Discourse Phenomena in Machine Translations, SMART-Select Workshop on Document-Level MT Evaluation, Luxembourg.
    • 14 November 2019: Gender Bias as an Issue in Natural Language Processing, Uppsala University, Faculty of Languages (Docent lecture).
  • I’m one of the organisers of the following two workshops in 2020:

Bio

I’m a Researcher and Associate Professor (Docent) in Computational Linguistics at Uppsala University. From summer 2019 until the end of 2020, I’m working at the University of Edinburgh, and during some of that time, I was also a Senior Researcher at the School of Informatics at the University of Edinburgh. Before I came to Uppsala, I was in the machine translation group at Fondazione Bruno Kessler in Trento.

I hold a PhD in Computational Linguistics from Uppsala University. My PhD thesis was on Discourse in Statistical Machine Translation and received the Best Thesis Award of the European Association for Machine Translation in 2015. My supervisors were Joakim Nivre, Jörg Tiedemann and Marcello Federico. I also have an MA in Nordic Philology from the University of Basel.

Research

I work in computational linguistics, and my research is at the intersection of statistical natural language processing, machine translation, translation studies and text linguistics. My goal is to study high-level problems in translation using methods from statistical NLP and machine translation and use those insights to enable progress in MT and multilingual NLP.

I’m particularly interested in how referring expressions such as pronouns (like she, it, they or this) and lexical noun phrases (like a cat, the house, scrambled eggs or my research) are used across languages, how human translators treat them, what machine translation systems should do with them and how we can use multilingual data to help us interpret them automatically.

Here are some of the angles from which I’ve approached these problems in the last few years:

  • Discourse-level MT and its evaluation (1, 2, 3)
  • Automatic discovery of discourse-related language contrasts in human-translated parallel corpora (4, 5)
  • Cross-lingual studies of the generation and interpretation of referring expressions with human subjects (6, 7)
  • Coreference annotation of multilingual corpora (8, 9)
  • Neural language modelling (10) and cross-lingual pronoun prediction (11, 12)

More recently, I’ve done some work on gender bias in MT and NLP more generally, a problem that often manifests itself in the interpretation and generation of referring expressions (13, 14).

In 2020, I’m one of the organisers of the 2nd Workshop on Gender Bias in Natural Language Processing (GeBNLP) at COLING in Barcelona and the 1st Workshop on Computational Approaches to Discourse (CODI) at EMNLP in Punta Cana. In 2019, I helped organise the 1st Workshop on Gender Bias in Natural Language Processing (GeBNLP) and the 4th Workshop on Discourse in MT (DiscoMT).