WMT 2015 News Crawl
This data set consists of text crawled from online news, with the html stripped out and sentences shuffled. The source data are crawled from online news sites and carry the respective licensing conditions. English, German, Czech plus variable guest languages. 2015 - http://www.statmt.org/wmt15/training-monolingual-news-2014.v2.tgz
People who looked at this resource also viewed the following: