Giter Site home page Giter Site logo

brain-facens / fieldvision-ai Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 128.4 MB

API for processing text on invoices, with the aim of identifying relevant fields on an invoice and optimizing bonus or validation systems.

License: Apache License 2.0

Python 92.47% Dockerfile 3.64% Shell 3.89%
paddlepaddle python text-classification devops ocds ocr

fieldvision-ai's Introduction

FieldVision AI

API_demo.mp4

API for processing text on invoices, with the aim of identifying relevant fields on an invoice and optimizing bonus or validation systems. Making life easier for logisticians, merchants and managers, the application has an interface that captures images from the webcam, processes the image using OCR and provides a visualization of the results obtained.

Docs

Visit our wiki

First Steps

This project was developed to run a web interface, where the user will have access to the image capture system and the data processed from that image. Follow the steps below to use this project, the stable version is in the main branch.

Requirements

The following requirements should ideally be met for proper operation:

  • Ubuntu 20.04 (Original development environment, but compatible with 18.04 and 22.04)

  • Docker Engine or Docker Desktop

  • Python 3.8

  • Git

  • Anaconda/Miniconda

  • Create a virtual environment for the project if you want to work without conda;

    conda create --name fielvision python=3.8 --channel https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/free/
    
    # Activate the environment
    conda activate fielvision
    

Warning! We recommend that you use the Docker container developed for this project, as the application is stable. This avoids the risk of incorrect installations, path errors and library versions.


Installation

Local:

# Activate the environment
conda activate fielvision

# Clone repository
git clone https://github.com/brain-facens/FieldVision-AI.git

# Install requirements
cd FieldVision-AI/
pip install -r requirements.txt

Docker:

# Pull image
docker pull brain20/ocr-notas

Usage

Local:

# Applicable filter of up to 3 words 
# python src/field_vision_API/main.py <fist, second, third>

# Run API
python src/field_vision_API/main.py

Docker:

# Running the container with the application
docker run -it --rm -p 8080:8080 brain20/ocr-notas

Warning! Running the container will start the API, which is the interface between OCR processing and the end user. The API is documented in a Swagger, where you can test it.


Demo

API_demo_2.mp4

🤝 Collaborators

We would like to thank the following people who contributed to this project:

Foto do Natanael Vitorino no GitHub
Natanael Vitorino
Foto do Natanael Vitorino no GitHub
Lucas Oliveira
Foto do Pedro Gabriel no GitHub
Pedro Gabriel

📝 License

This project is under license. See the file LICENSE for more details.


fieldvision-ai's People

Contributors

lucaslco avatar nata-vito avatar pedroogaab avatar

fieldvision-ai's Issues

Documentação

Realizar a atualização da documentação do código e aplicação de padrões de qualidade de software.

  • Atualizar o README
  • Docstrings
  • Padrão de código
  • Review

Validação da inserção de string para filtragem

@pedroogaab e @LucasLCO Ao testar o código, me deparei com um erro de implementação por usuário. Este erro é simplismente enviar a imagem pelo método POST e não declarar nenhuma palavra como filtro pelo método PUT, resultando assim no erro de objeto não interativo. Abaixo demonstro os passos para replicar o bug.

POST da imagem para processamento
image

PUT do filtro
image

Console da API
image
image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.