Giter Site home page Giter Site logo

sanzharid / speak-extension Goto Github PK

View Code? Open in Web Editor NEW

This project forked from charliegerard/speak-extension

0.0 0.0 0.0 10.31 MB

[WIP] - Chrome extension with speech-to-text functionality to make your presentations more accessible with live captions

JavaScript 32.47% CSS 19.82% HTML 47.71%

speak-extension's Introduction

Speak - Chrome extension

Speak is a Chrome extension using the Speech Recognition API to provide a speech-to-text interface as an overlay on a website. The main goal is to allow people to make their presentations more accessible for the deaf and hard-of-hearing.

Important note:

As any speech recognition system, it is not working perfectly but hopefully it can still be somehow useful and will probably improve over time.

screenshot of the extension's interface

Current status:

So far, seems to be working on:

How to install:

Dev version:

  • Download or clone this repo.
  • Go to the Chrome extension interface.
  • Activate Developer mode (top-right corner).
  • Click on the "Load unpacked" button in the menu bar.
  • Select the folder where you cloned the project.

You should now see the icon of the extension in your browser extension bar.

How to use:

  • Click on the extension in your browser's menu bar.

  • Select your spoken language.

  • Click on the "Start" button.

  • Allow access to your microphone if you haven't already.

  • Detection:

    • If the website you are currently trying to use this extension on is blocking access to AudioCapture, you will see an error message displayed.
    • If you do not see an error message, you should be able to start speaking and see an overlay appear on the page with the caption.
  • Click on the extension logo again in the menu bar to close the extension.

  • To stop the tracking, open the extension and click on the "Stop" button.

If the Speech Recognition API does not recognise what you are trying to say, it will default to English (US).

Example:

Speak demo

Watch a video example

Languages supported:

(I found this list in a tutorial so not 100% sure it's accurate)

Languages Region Language code
Afrikaans Default af-ZA
Bahasa Indonesia Default id-ID
Bahasa Melayu Default ms-MY
Català Default ca-ES
Čeština Default cs-CZ
Dansk Default da-DK
Deutsch Default de-DE
English Australia en-AU
English Canada en-CA
English India en-IN
English New Zealand en-NZ
English South Africa en-ZA
English United Kingdom en-GB
English United States en-US
Español Argentina es-AR
Español Bolivia es-BO
Español Chile es-CL
Español Colombia es-CO
Español Costa Rica es-CR
Español Ecuador es-EC
Español El Salvador es-SV
Español España es-ES
Español Estados Unidos es-US
Español Guatemala es-GT
Español Honduras es-HN
Español México es-MX
Español Nicaragua es-NI
Español Panamá es-PA
Español Paraguay es-PY
Español Perú es-PE
Español Puerto Rico es-PR
Español República Dominicana es-DO
Español Uruguay es-UY
Español Venezuela es-VE
Euskara Default eu-ES
Filipino Default fil-PH
Français Default fr-FR
Galego Default gl-ES
Hrvatski Default hr_HR
IsiZulu Default zu-ZA
Íslenska Default is-IS
Italiano Italia it-IT
Lietuvių Default lt-LT
Magyar Default hu-HU
Nederlands Default nl-NL
Norsk bokmål Default nb-NO
Polski Default pl-PL
Português Brasil pt-BR
Português Portugal pt-PT
Română Portugal ro-RO
Slovenščina Default sl-SI
Slovenčina Default sk-SK
Suomi Default fi-FI
Svenska Default sv-SE
Tiếng Việt Default vi-VN
Türkçe Default tr-TR
Ελληνικά Default el-GR
български Default bg-BG
Pусский Default ru-RU
Српски Default sr-RS
Українська Default uk-UA
한국어 Default ko-KR
中文 普通话 (**大陆) cmn-Hans-CN
中文 普通话 (香港) cmn-Hans-HK
中文 中文 (台灣) cmn-Hant-TW
中文 粵語 (香港) yue-Hant-HK
日本語 Default ja-JP
हिन्दी Default hi-IN
ภาษาไทย Default th-TH

Tech stack:

  • HTML/CSS/JS
  • Speech Recognition API

Current known issues:

  • The recognition is not always very accurate
  • Sometimes the tracking stops unexpectedly

To do:

  • Fix UI so only displays a certain number of words so it doesn't take too much space on the screen.
  • Support multi-language?
  • Allow translation in other languages than english with Google translate? (Might be too slow...)
  • Redesign

speak-extension's People

Contributors

charliegerard avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.