publications

2024

  1. EMNLP
    BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training
    Pavel Chizhov, Catherine Arnett, Elizaveta Korotkova, and 1 more author
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  2. EAMT
    Estonian-Centric Machine Translation: Data, Models, and Challenges
    Elizaveta Korotkova and Mark Fishel
    In Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024
  3. LREC-COLING
    Multilinguality or Back-translation? A Case Study with Estonian
    Elizaveta Korotkova, Taido Purason, Agnes Luhtaru, and 1 more author
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024
  4. EACL
    No Error Left Behind: Multilingual Grammatical Error Correction with Pre-trained Translation Models
    Agnes Luhtaru, Elizaveta Korotkova, and Mark Fishel
    In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

  1. NoDaLiDa
    Distilling Estonian Text Domains for Production-Oriented Machine Translation
    Elizaveta Korotkova and Mark Fishel
    In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), 2023
  2. preprint
    Beyond Toxic: Toxicity Detection Datasets are Not Enough for Brand Safety
    Elizaveta Korotkova and Isaac Chung
    2023

2021

  1. WMT
    Translation Transformers Rediscover Inherent Data Domains
    Maksym Del*, Elizaveta Korotkova*, and Mark Fishel
    In Proceedings of the Sixth Conference on Machine Translation, 2021

2019

  1. preprint
    Grammatical Error Correction and Style Transfer via Zero-shot Monolingual Translation
    Elizaveta Korotkova, Agnes Luhtaru, Maksym Del, and 3 more authors
    2019
  2. WMT
    University of Tartu‘s Multilingual Multi-domain WMT19 News Translation Shared Task Submission
    Andre Tättar, Elizaveta Korotkova, and Mark Fishel
    In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), 2019