| mtedx/valid |
mTEDx evaluation data, valid: URL |
| mtedx/test |
mTEDx evaluation data, test: URL |
| wmt23 |
Official evaluation and system data for WMT23. |
| wmt22 |
Official evaluation and system data for WMT22. |
| wmt21/systems |
WMT21 system output. |
| wmt21/dev |
Development data for WMT21,if multiple references are available, the first one is used. |
| wmt21/D |
Official evaluation data for WMT21 with reference D |
| wmt21/C |
Official evaluation data for WMT21 with reference C |
| wmt21/B |
Official evaluation data for WMT21 with reference B. |
| wmt21/AC |
Official evaluation data for WMT21 with references A and C |
| wmt21/AB |
Official evaluation data for WMT21 with references A and B. |
| wmt21 |
Official evaluation data for WMT21. |
| wmt20/robust/set1 |
WMT20 robustness task, set 1 |
| wmt20/robust/set2 |
WMT20 robustness task, set 2 |
| wmt20/robust/set3 |
WMT20 robustness task, set 3 |
| wmt20/tworefs |
WMT20 news test sets with two references |
| wmt20 |
Official evaluation data for WMT20 |
| mtnt2019 |
Test set for the WMT 19 robustness shared task |
| mtnt1.1/test |
Test data for the Machine Translation of Noisy Text task: URL |
| mtnt1.1/valid |
Validation data for the Machine Translation of Noisy Text task: URL |
| mtnt1.1/train |
Training data for the Machine Translation of Noisy Text task: URL |
| wmt20/dev |
Development data for tasks new to 2020. |
| wmt19 |
Official evaluation data. |
| wmt19/dev |
Development data for tasks new to 2019. |
| wmt19/google/ar |
Additional high-quality reference for WMT19/en-de. |
| wmt19/google/arp |
Additional paraphrase of wmt19/google/ar. |
| wmt19/google/wmtp |
Additional paraphrase of the official WMT19 reference. |
| wmt19/google/hqr |
Best human selected-reference between wmt19 and wmt19/google/ar. |
| wmt19/google/hqp |
Best human-selected reference between wmt19/google/arp and wmt19/google/wmtp. |
| wmt19/google/hqall |
Best human-selected reference among original official reference and the Google reference and paraphrases. |
| wmt18 |
Official evaluation data. |
| wmt18/test-ts |
Official evaluation sources with extra test sets interleaved. |
| wmt18/dev |
Development data (Estonian<>English). |
| wmt17 |
Official evaluation data. |
| wmt17/B |
Additional reference for EN-FI and FI-EN. |
| wmt17/tworefs |
Systems with two references. |
| wmt17/improved |
Improved zh-en and en-zh translations. |
| wmt17/dev |
Development sets released for new languages in 2017. |
| wmt17/ms |
Additional Chinese-English references from Microsoft Research. |
| wmt16 |
Official evaluation data. |
| wmt16/B |
Additional reference for EN-FI. |
| wmt16/tworefs |
EN-FI with two references. |
| wmt16/dev |
Development sets released for new languages in 2016. |
| wmt15 |
Official evaluation data. |
| wmt14 |
Official evaluation data. |
| wmt14/full |
Evaluation data released after official evaluation for further research. |
| wmt13 |
Official evaluation data. |
| wmt12 |
Official evaluation data. |
| wmt11 |
Official evaluation data. |
| wmt10 |
Official evaluation data. |
| wmt09 |
Official evaluation data. |
| wmt08 |
Official evaluation data. |
| wmt08/nc |
Official evaluation data (news commentary). |
| wmt08/europarl |
Official evaluation data (Europarl). |
| iwslt17 |
Official evaluation data for IWSLT. |
| iwslt17/tst2016 |
Development data for IWSLT 2017. |
| iwslt17/tst2015 |
Development data for IWSLT 2017. |
| iwslt17/tst2014 |
Development data for IWSLT 2017. |
| iwslt17/tst2013 |
Development data for IWSLT 2017. |
| iwslt17/tst2012 |
Development data for IWSLT 2017. |
| iwslt17/tst2011 |
Development data for IWSLT 2017. |
| iwslt17/tst2010 |
Development data for IWSLT 2017. |
| iwslt17/dev2010 |
Development data for IWSLT 2017. |
| multi30k/2016 |
2016 flickr test set of Multi30k dataset |
| multi30k/2017 |
2017 flickr test set of Multi30k dataset |
| multi30k/2018 |
2018 flickr test set of Multi30k dataset. See URL for evaluation. |