CLUE Cross-Language Unit Elicitation Alignments




This corpus was referred to as PTSTAR Golden Collection in the deliverables and reports of the project.

It consists of a set of manual alignments of 400 parallel sentences from the Europarl corpora [1] in four languages (pt, en, es, fr), being considered the following pairs: en-es, en-fr, en-pt, es-fr, pt-es. This work deeply extends the corpus detailed in [2].

