INTERA Corpus - the Greek-English part The Greek-English part of the INTERA corpus; written, domain specific (law, education, environment, health and tourism) parallel subcorpus; 4MWs (2 MWs per language); TMX format. Distribution Availability
Available - Restricted Use
Licence CC - BY
Restrictions: Attribution
Distribution Access/Medium: Downloadable
Attribution Details: The INTERA Corpus - Greek-English part of ILSP/RC Athena licensed under CC-BY as accessed via META-SHARE
Contact Person
Bilingual text corpus Languages
Greek, Modern (1453-)
(2,000,000 Words)
English
(2,000,000 Words)
Linguality Linguality type: Bilingual
Multi-linguality type: Parallel
Text Format Size Character encoding
UTF - 8
Domains
law
education
environment
tourism
health
Modalities Annotation Alignment StandOff: False
Segmentation level: Sentence
Format: application/x-tmx+xml
Standard practices conformance: TMX
Creation Creation mode details: web crawling; manual selection; semi-automatic conversion to the desired formats
Creation mode: Mixed
Resource Creation
Creation lasted: 01/01/2003 - 12/31/2004
Funding Project Integrated European language data Repository Area (INTERA - e-content EDC-22076 INTERA / 27924)
Funding Type: Eu Funds
Funder: eContent
Project duration: 01/01/2003 - 12/31/2004
Metadata Created: 02/02/2012
Last Updated: 11/26/2015
Usage Foreseen Use Nlp Applications Use NLP Specific: Machine Translation
Actual Use - Nlp Applications Use NLP Specific: Terminology Extraction
Relation
Related Resource: INTERA corpus
Relation Type: isPartOf
Documentation
Document Type: In Proceedings
Maria Gavrilidou and Penny Labropoulou and Elina Desipri and Voula Giouli et al,
Building parallel corpora for eContent professionals ,
, COLING 2004
, 2004
Book Title: Proceedings of COLING 2004
Document Type: In Proceedings
Maria Gavrilidou and Penny Labropoulou and Stelios Piperidis et al,
Language resources production models: the case of INTERA multilingual corpus and terminology ,
, 5th International Conference on Language Resources and Evaluation (LREC-2006)
, 2006
Book Title: Porceedings of the 5th International Conference on Language Resources and Evaluation (LREC-2006)
Document Type: In Proceedings
Maria Gavrilidou and Penny Labropoulou and Monica Monachini and Stelios Piperidis and Claudia Soria,
Building Multilingual Terminological Resources ,
, RANLP 2005 International Workshop on Language and Speech Infrastructure for Information Access in the Balkan Countries
, 2005
Book Title: Proceedings of the RANLP 2005 International Workshop on Language and Speech Infrastructure for Information Access in the Balkan Countries
Document Type: Tech Report
Maria Gavrilidou and Voula Giouli and Elina Desipri and Penny Labropoulou and Monica Monachini et al,
D5.2 - Report on the multilingual resources production ,
http://www.elda.org/...
, 2004
People who looked at this resource also viewed the following: