PELCRA EN Lemmatizer
PELCRA EN Lemmatizer is a British National Corpus-derived lemma dictionary for the Java-based Morfologik stemming library (see http://morfologik.blogspot.com/). It contains a list of unique words appearing in the BNC together with their lemmas and BNC tags that contain part of speech information (see http://www.natcorp.ox.ac.uk/docs/gramtag.html). Note that both the bncLemmatizer.dict and the bncLemmatizer.info files are necessary for the tool to run. Documentation explaining the use of the lemmatizer is available at: http://pelcra.pl/res/en_lemmatizer.
People who looked at this resource also viewed the following: