Giter Site home page Giter Site logo

decaptcha's Introduction

Convolutional neural network for captcha recognizing

The project goal is to recognize captcha images with use of conv net. For network creation and trainning Tensorflow library was used.

Project is a playground for testing different architectures, training techniques, activation functions and optimizers.

Data

Each image has 57x300 dim, but while loading it is padded with 0 to 64x304. It is better size for further processing by conv net.

The image label is simply file name.

Trainning labes

Captchas' words has differnet lengts, starting from 3 chars up to 20. In order to have fixed vector size the labels were encoded as fixed size vectors. We assume that label contains max 20 characters, each character could be '0...9A...Za...z_' char 0 has an index 0 char A has an index 10 etc. The vector contains only 0 and 1, each char position in word is encoded by 63 continous vector positions, index 0 encodes occurence of char '0' at firs postion, index 63 encodes occurence of char '0' at second position in the word

eg. word = 'at' looks like

vec[36]=1 vec[118]=1

because, at first position we have char 'a' so vec[0:9]=0 it is reserved for digits vec[10:35] is reserved for big latin letters vec[36:61] is reserved for small latin letters, firs in alphabet is 'a' so it is at 36 position vec[62] - reserved for '_'

character at second place 't' vec[63:72] - digits vec[73:98] - big letters vec[99:124] - small letters, 't' is 19 letter in alphabet so 99+19=118 vec[125] - '_'

Network architecure

The network is 6 layer Convolutional network:

  • 3x3 conv - 1-> 32 (filter depth)
  • 3x3 conv - 32-> 32
  • max pool
  • dropout
  • 3x3 conv - 32-> 64 (filter depth)
  • 3x3 conv - 64-> 64
  • max pool
  • dropout
  • 3x3 conv - 64-> 64 (filter depth)
  • max pool
  • dropout
  • fully connected 1024
  • output - 20*63 (20 positions times 63 different chars )

decaptcha's People

Contributors

ksopyla avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.