Giter Site home page Giter Site logo

brailledetector-filnal's Introduction

Brailledetector filnal

A CNN approach to detect braille in an image

The Braille detector is a group project for Artificial Intelligence coursework. The project aims to aid accessibility of visually impaired individuals and aid integration into the society.

Braille Detection and Transcription

The project uses convolutional neural network to recognize Braille characters from images and individually translates each braille character.

Dataset

The dataset is for the braille detection is in my google drive which can be found at (https://drive.google.com/drive/folders/1vHuEg5pr3S8fvYu0RCi27C4NsFPrMV-x?usp=sharing) while the transcription dataset can be found in a public kaaggle dataset (https://www.kaggle.com/datasets/shanks0465/braille-character-dataset) The braille detection dataset consist of augmented braille images.

Installation

To run successfully this project, you would need to install the dependencies found in requirements.txt file:

Usage

To use this project, follow these steps:

Web app - Streamlit

  1. Clone the repository to your local machine
  2. Navigate to the project directory
  3. navigate and open main.py
  4. To run the file successfully, install the dependencies in the requirements.txt by entering [pip install -r requirements.txt] in the terminal
  5. Also in the terminal, enter and run the script using [streamlit run main.py]
  6. Ensure the trained Model (brailledetect2.h5) should be in the same folder
  7. running the app, the app redirects to your local browser

Live tracker

  1. As above, ensure installation of the dependencies in the requirements.txt
  2. In a code editor, preferably pycharm, run the script.
  3. If the camera window does not pop up, change the following code: # Starts camera cap = cv2.VideoCapture(0) #<< change 0 to 1 or 2

Models

A VGG16 model for braille detection and Sequential model for transcription which are in the Brai Detection - Copy.ipynb and Braille-character-transcription-cnn.ipynb respectively.

Credits

The Braille character dataset used in this project was obtained from Kaggle and https://github.com/IlyaOvodov/AngelinaDataset. The neural network architecture was inspired by https://github.com/nicknochnack/FaceDetection and https://www.kaggle.com/code/amanrosekaursethi/braille-character-detection-using-cnn.

brailledetector-filnal's People

Contributors

deyem1 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.