Abstract
In this thesis I will present an investigation of Natural Language Toolkit (NLTK) and its support for Norwegian Natural Language Processing (NLP). I display what NLTK has to offer for Norwegian NLP, then move on to evaluate and improving some of the offers NLTK has for Norwegian. I will evaluate and improve NLTK’s sentences tokenizer and word tokenizer, I will also compare the tokenizers to other available options for Norwegian tokenization. The improvements will be committed to NLTK for possible integration. Then I will integrate a Norwegian corpus with a corpus reader to NLTK.