## New test sets. At long last, you can get WMT22 data from sacrebleu.
sacrebleu -t wmt22 -l en-de --echo src
## Metadata fields
The `echo` option will now output other fields, such as “origlang” and “docid”. These are available even to old formats, e.g.,
sacrebleu -t wmt15 -l en-fr --echo origlang docid src
# list all fields
sacrebleu -t wmt15 --list
This makes it easy to, say, grep out non-source-language data, or group sentences by document.
## System outputs
Since the matrix is no longer with us (RIP), there has been no good way to get scores for system submissions. Thanks to the new XML format that Barry developed and deployed last year, this is now easy. You can get wmt21 systems using the wmt21/systems target, as well as wmt22.
To see a list of systems, give some invalid output to `--echo`:
sacrebleu -t wmt21/systems -l en-de --echo i-miss-the-matrix
sacrebleu -t wmt22 -l uk-cs --echo CharlesTranslator
You can then easily score them by piping the output right back into sacrebleu, e.g.,
$ sacrebleu -t wmt22 -l en-cs --echo Online-B | sacrebleu -t wmt22 -l en-cs -tok flores101
{
"signature": "nrefs:1|case:mixed|eff:no|tok:flores101|smooth:exp|version:2.3.0",
"verbose_score": "72.5/57.6/48.1/41.0 (BP = 0.992 ratio = 0.992 hyp_len = 52182 ref_len = 52591)",
(Add “-f text” if you prefer the old text-based signature). The default remains 13a. Why? Because we’ve always done it that way!