Giter Site home page Giter Site logo

pylangdetector's Introduction

pyLangDetector

super simple language detector written in python using https://pypi.org/project/langdetect/.

  • counting of *.txt files per language
  • printing out filename with language comma separated to easly use as csv
  • supports 55 languages

Supported languages

hi, hr, hu, id, it, ja, kn, ko, lt, lv, mk, ml, mr, ne, nl, no, pa, pl,
pt, ro, ru, sk, sl, so, sq, sv, sw, ta, te, th, tl, tr, uk, ur, vi, zh-cn, zh-tw```

Execution

count file per language in a folder

python3.7 detectLanguage.py count {directory}
en : 171
de : 12
fr : 19

count file per language in a folder and print out language, filename tuple

python2 detectLanguage.py print {directory}
en : 171
en,7 August 2003_0_0.txt
en,20 January 2016_0_0.txt
en,1 August 2002_0_0.txt
en,7 February 1998_0_0.txt
en,15 June 1998_0_0.txt
en,21 November 2015_0_0.txt
en,24 April 2000_0_0.txt
en,18 March 2001_0_0.txt
en,19 July 2004_0_0.txt
en,21 March 2011_0_0.txt
en,23 July 2012_0_0.txt
en,19 June 2018_0_0.txt
en,25 April 2002_0_0.txt
en,26 May 2012_0_0.txt
en,9 August 2013_0_0.txt
en,1 February 2011_0_0.txt
...

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.