Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incomparable results in the WMT 2014 EN-DE table for machine translation #412

Open
rihardsk opened this issue Jan 29, 2020 · 1 comment
Open

Comments

@rihardsk
Copy link

I noticed that some of the results reported in the WMT 2014 EN-DE table are obtained by models trained on data from newer WMT datasets (but they report results on newstest2014), e.g, Edunov et al. (2018) uses WMT’18 and Wu et al. (2019) uses WMT’16 for training.

The few results on WMT 2014 EN-FR that i checked were fine though. Here are the papers i checked

Paper en-de data en-fr data
Transformer (Vaswani et al., 2017) WMT’2014 WMT’2014
AdvSoft + Transformer Big (Wang et al., 2019) WMT’2014
MUSE (Zhao et al., 2019) WMT’2014 WMT’2014
DynamicConv (Wu et al., 2019) WMT’2016 WMT’2014
Transformer Big + BT (Edunov et al., 2018) WMT’2018 WMT’2014
@sebastianruder
Copy link
Owner

Thanks for this observation. Do you think you could an asterisk (*) to the models that use newer data for training?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants