The TaraXÜ Corpus of Human-Annotated Machine Translations - 4rth evaluation round

153 Last view: 2024-05-04

2 Last update: 2014-10-21

33 Last download: 2021-11-07

The TaraXÜ Corpus of Human-Annotated Machine Translations - 4rth evaluation round

TaraXÜ corpus round 4

http://www.qt21.eu/launchpad

The corpus was created in the framework of the TaraXU project. The approach rises from the need to detach Machine Translation (MT) evaluation from a pure research-oriented development scenario and to bring it closer to the end users. Therefore, three evaluation rounds were performed in close co-operation with translation industry. The evaluation process has been designed in order to answer particular questions closely related with the applicability of MT within a real-time professional translation environment. All evaluation tasks have been performed by qualified professional translators.

The evaluation rounds, resulting in the corpus discussed in this paper, built on one another in a logical procession: the first round created baseline results, whereas each further round was
concerned with more elaborated measuring methods and more specific factors impacting translation quality. Findings of evaluating the results from these rounds have been published in Avramidis et. al 2012 and Popovic et. al. 2013. Parts of the corpus have more recently been used in the QTLaunchPad project [http://www.qt21.eu/launchpad] where they served as the basis for a more detailed error analysis.

Note that this is one part of the corpus. More parts (to) appear in separate entries.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Unrestricted Use

Licence

CC - BY

Restrictions: Attribution

Fee: 0

Download location: hidden

Distribution Access/Medium: Downloadable

IPR Holder

German Research Center for Artificial Intelligence

Contact Person

Eleftherios Avramidis

text

Bilingual text corpusLanguages

Czech German English French Spanish; Castilian

Linguality

Linguality type: Bilingual

Multi-linguality type: Parallel

Size

2,000 Sentences

Modalities

Written Language

Resource Creation

Resource Creator

Deutsches Forschungszentrum für Künstliche Intelligenz

Creation lasted: 05/01/2010 - 09/01/2013

Funding Project

TaraXÜ

URL: http://taraxu.dfki.de

Funding Type: Other

Funder: Technologiestiftung Berlin

Funding Country: Germany

Project duration: 05/01/2010 - 09/30/2013

QTLaunchPad

URL: http://www.qt21.eu/l...

Funding Type: Eu Funds

Funder: European Union FP7

Project duration: 07/01/2012 - 06/30/2014

Metadata

Created: 03/24/2014

Last Updated: 10/21/2014

Version

Version: 0.1

Last Updated: 06/01/2014

Documentation

Document Type: In Proceedings

Eleftherios Avramidis and Aljoscha Burchardt and Sabine Hunsicker and Maja Popovic and Cindy Tscherwinka and David Vilar Torres and Hans Uszkoreit, The taraXU Corpus of Human-Annotated Machine Translations, http://www.lrec-conf... , pp. 2679-2682 , Ninth International Conference on Language Resources and Evaluation (LREC'14) , 2014

Publisher: European Language Resources Association (ELRA)

ISBN: 978-2-9517408-8-4

Keywords: human evaluation, SMT, Moses, rule-based

Document Language: English

People who looked at this resource also viewed the following:

Resources from the same project

The TaraXÜ Corpus of Human-Annotated Machine Translations - 4rth evaluation round

TaraXÜ corpus round 4

http://taraxu.dfki.de,

http://www.qt21.eu/launchpad