Giter Site home page Giter Site logo

textract-demo's Introduction

AWS Textract Project

New V2 Demo

textract-demo-v2.mp4

This project aims to create a robust application that utilizes AWS Textract to extract information from structured and semi-structured PDF documents, and OpenAI's ChatGPT to interact with the application and analyze the results. These documents may contain both handwritten and printed text.

About AWS Textract

AWS Textract is a service that automatically extracts text and data from scanned documents. It goes beyond simple optical character recognition (OCR) to identify, understand, and extract data from forms and tables.

About ChatGPT

ChatGPT is a language model developed by OpenAI. In this application, we use it to interact with users and analyze the results of AWS Textract.

Getting Started

To use this application, follow the steps below:

Prerequisites

You will need an AWS account, OpenAI account, the AWS CLI (Command Line Interface) installed on your local machine, and setup your OpenAI and AWS credentials to authenticate with their services. Here is a guide to set up AWS. For OpenAI setup, please refer to the OpenAI API documentation.

Installing

  1. Install Anaconda on your machine. Visit the Anaconda website for installation instructions.

  2. Create a new conda environment and install the required dependencies by running the following commands:

$ conda env create -f environment.yml
$ conda activate aws-text

Running the Application

  1. Make sure you have activated the aws-text conda environment.

  2. Run the application.py file to start the application. This file sets up a local server that accepts PDF uploads, runs AWS Textract on the uploaded PDFs, then uses ChatGPT to interact with the application and analyze the results.

$ python application.py
  1. Access the application by opening your web browser and navigating to http://localhost:5000.

  2. Upload your PDF files through the web interface. The results of AWS Textract will be displayed on the page, and you can also interact with ChatGPT to analyze the results. You can download the results in JSON, PDF (with bounding boxes), and CSV formats.

View results

The outputs are located in app/results/textract_results, app/results/bounding_box_results, and app/results/table_results.

Built With

Author

textract-demo's People

Contributors

bmitch44 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.