WMT 2016 Automatic Post-editing and Quality Estimation data set ![Corpus](/site_media/css/sexybuttons/images/icons/silk/database_yellow.png)
ID:
http://hdl.handle.net/11372/LRT-2390
Training, development and text data consist of English-German triplets (source, target and post-edit) belonging to the Information Technology domain and already tokenised. Training and development respectively contain 12,000 and 1,000 triplets, while the test set contains 2,000 instances. Target sentences are machine-translated with the KIT system. Post-edits are collected by Text & Form from professional translators.
IMPORTANT LEGAL NOTICE (This dataset is provided under the following terms of use)
TAUS Terms of Use (https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21).
TAUS grants to QT21 User access to the WMT Data Set with the following rights:
i) the right to use the target side of the translation units into a commercial product, provided that QT21 User may not resell the WMT Data Set as if it is its own new translation;
ii) the right to make Derivative Works; and
iii) the right to use or resell such Derivative Works commercially and for the following goals:
i) research and benchmarking;
ii) piloting new solutions; and
iii) testing of new commercial services.
People who looked at this resource also viewed the following:
- WMT 2018 Quality Estimation Core Data Set
- WMT 2017 Automatic Post-editing and Quality Estimation data set
- Metallography and Metal Technology. IV, Mechanical properties and testing. Non-destructive testing. Estonian-English-German-Russian terms and definitions
- International civil aviation terminology, definitions and abbreviations