Giter Site home page Giter Site logo

bukalapak / ktpextractor Goto Github PK

View Code? Open in Web Editor NEW
59.0 216.0 23.0 1.52 MB

This is a service which takes KTP image as the input, and extract the data in the KTP as the output. This is a part of open source project by Data Scientists of Bukalapak.

Python 100.00%
data datascience

ktpextractor's Introduction

KTPextractor

This is a service to extract data from KTP image. This is a part of open source project by Data Scientists of Bukalapak. Other open source projects: https://github.com/bukalapak?q=data

Config File

Please fill in the configuration in file kyc_config.py gcv_api_key_path: path location of the GCV API Key. To get an API, check https://cloud.google.com/vision/docs/setup json_loc = path location to save the OCR output from GCV output_loc = path location to save the extracted KTP data

OCR Text Extractor

To extract texts from an image (OCR), use the following command:

python ocr_text_extractor.py <image_path>

The OCR output file will be saved in the json_loc (check config file)

KTP Entity Extractor

To extract attributes from the KTP based on the OCR output, use the following command:

python ktp_entity_extractor.py <path of ocr output file>

The extracted KTP data will be saved in csv format in the output_loc (check config file)

KTP Data Extractor

To extract KTP data directly from KTP image, use the following command:

python KTPextractor_main.py <image_path>

The extracted KTP data will be saved in csv format in the output_loc (check config file)

ktpextractor's People

Contributors

hendrahc avatar saktipratama avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ktpextractor's Issues

Last Name included in Birth Place

Hi, Thanks for this great Project.

I have been trying with some KTP (downloaded from internet) and find some issue while last name (or degree) included in birth place and value more than one line (in this case with long real name)

attached for sample KTP and result
ktp4
ktp4_result

any advice for this?

many Thanks.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.