parkitny / german_transliterate Goto Github PK
View Code? Open in Web Editor NEWThis project forked from repodiac/german_transliterate
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
License: Creative Commons Attribution 4.0 International