Giter Site home page Giter Site logo

captcha-python-test's Introduction

captcha-python-test

--

受《验证码破解技术四部曲》https://github.com/nladuo 启发,学习验证码识别的相关技术,包括opencv、tesseract、机器学习算法(kNN和SVM)等,将原作者的算法改为python。

验证码识别的难点在于对图像的干扰的处理,只要能正确去除干扰、分割字符,通过机器学习或人工神经网络,识别率都还能比较高。目前对复杂的验证码的图形处理还有待提高。。。

--

1、相关依赖组件

  • opencv2
  • tesseract、pytesseract
  • numpy
  • libsvm

2、tesseract

对于简单的验证码,直接使用pytesseract可以识别

3、csdn1

纯数字、干扰小的验证码,简单进行图片去除背景、二值化和阈值处理后,使用kNN算法识别。

4、csdn2

字母加数字、背景有干扰、图形字符位置有轻微变形,进行图片去除背景、二值化和阈值处理后,使用kNN算法识别;相比csdn1,主要是进行图形规整化处理后与csdn1区别不大。

5、weibo.cn

背景有严重干扰(包括色斑、干扰线、噪声等)、字符变形类的验证码,识别的关键在于去干扰和提取字符,只要去干扰处理得好,使用足够的测试数据进行SVM(支持向量机)训练,识别率相当的高(>90%)。

captcha-python-test's People

Contributors

hanc00l avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.