Orwell 1984 Croatian ![Corpus](/site_media/css/sexybuttons/images/icons/silk/database_yellow.png)
hr1984![](/site_media/images/trenner.png)
ID:
319
The Croatian Orwell 1984 is a Croatian contribution to the MULTEXT-East resources, a multilingual dataset for language engineering research and development. This dataset contains linguistically annotated translations of Orwell's novel 1984 in Bulgarian, Czech, English, Estonian, Hungarian, Macedonian, Persian, Polish, Romanian, Serbian, Slovak, Slovene. This corpus adds the Croatian version to the set. The texts in this corpus are lemmatized and MSD-tagged following MTE v4.0 specifications.
People who looked at this resource also viewed the following:
- Original Short-Message Data Collation II in Chinese (named entities)
- PANACEA English-French and English-Greek parallel corpus acquired for Labour Legislation domain
- Mandarin Chinese Speech Recognition Corpus (telephone channel) - digit string (100 people)
- Mandarin Chinese Telephone Speech Recognition Corpus - Stock