QTLaunchPad MQM Annotated Corpora (version 2.0)

This resource consists of eight separate corpora (two each covering EN>ES, EN>DE, ES>EN, and DE>EN) comprised of MT (various systems) and human-translated segments annotated with a subset of the issues available from Multidimensional Quality Metrics (MQM) framework (with some custom extensions for more analytic detail). The segments were annotated by between 1 and 5 annotators (expert human linguists from commercial language service providers) using the translate5 environment and collated and transformed into a HTML resources with advanced filtering capabilities.

All data sets in version 2.0 are available as XML as well as HTML

