publications

2023

  1. prompts.png
    Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation
    Meriem BoubdirEdward Kim, Beyza Ermis, Marzieh Fadaee, and Sara Hooker
    2023
  2. less.png
    When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale
    2023
  3. v2.png
    InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval
    Vitor Jeronymo, Luiz Bonifacio, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, Jakub Zavrel, and Rodrigo Nogueira
    2023

2022

  1. In Defense of Cross-Encoders for Zero-Shot Retrieval
    Guilherme Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, and Rodrigo Nogueira
    2022
  2. inpars.png
    InPars: Data Augmentation for Information Retrieval using Large Language Models
    Luiz Henrique Bonifacio, Hugo Abonizio, Marzieh Fadaee, and Rodrigo Nogueira
    In SIGIR, Feb 2022
  3. noparam.png
    No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval
    Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, and Rodrigo Nogueira
    In arXiv, Feb 2022

2021

  1. mmarco.png
    mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
    Luiz Bonifacio, Vitor Jeronymo, Hugo Queiroz Abonizio, Israel Campiotti, Marzieh Fadaee, Roberto Lotufo, and Rodrigo Nogueira
    In arXiv, Feb 2021

2020

  1. final_cover.png
    Understanding and Enhancing the Use of Context for Machine Translation
    Marzieh Fadaee
    Oct 2020
  2. za.png
    A New Neural Search and Insights Platform for Navigating and Organizing AI Research
    Marzieh Fadaee, Olga Gureenkova, Fernando Rejon Barrera, Carsten Schnober, Wouter Weerkamp, and Jakub Zavrel
    In Proceedings of the First Workshop on Scholarly Document Processing, Nov 2020
  3. vol.png
    The Unreasonable Volatility of Neural Machine Translation Models
    Marzieh Fadaee, and Christof Monz
    In Proceedings of the Fourth Workshop on Neural Generation and Translation, Jul 2020

2018

  1. bt.png
    Back-Translation Sampling by Targeting Difficult Words in Neural Machine Translation
    Marzieh Fadaee, and Christof Monz
    In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), Jul 2018
  2. idiom.png
    Examining the Tip of the Iceberg: A Data Set for Idiom Translation
    Marzieh FadaeeArianna Bisazza, and Christof Monz
    In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), May 2018

2017

  1. tda.png
    Data Augmentation for Low-Resource Neural Machine Translation
    Marzieh FadaeeArianna Bisazza, and Christof Monz
    In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), Jul 2017
  2. emb.png
    Learning Topic-Sensitive Word Representations
    Marzieh FadaeeArianna Bisazza, and Christof Monz
    In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), Jul 2017

2013

  1. Automatic WordNet Construction Using Markov Chain Monte Carlo
    Marzieh FadaeeHamidreza GhaderHeshaam Faili, and Azadeh Shakery
    Polibits, Jul 2013