Python scripts to process german wiki dump. This is to generate a german text corpus for unsupervised word representation learning. Especially for training an BILM: https://github.com/allenai/bilm-tf
More about this see the wiki: https://eniak.de/it/training_of_german_word_embedding_for_nlp