Giter Site home page Giter Site logo

speech_bot's Introduction

Speech_Bot

an attempt at creating a python program to imitate human speach

The Original Idea for the Algorithm

Read in textual data. create a graph where each node is a word and there are directed edges from that word to any word which has comes after it in any of the textual data. Start with a seed word. To generate the next word read in the current chain of words. Assign weights to the edges to the next words based on how many time the word preceded the next word. When calculating weights also include more heavily weighted those words which came after the same n length chain, with longer n corresponding to higher weighting. With some max n chosen. Statistically chose which edge to follow based on the edge weights.

for the ngram section i got the data from http://www.ngrams.info/download_coca.asp. I am using the free 2-grams.

Things to work on

  • [x]Take in sentance history for more precise edge weights
  • []Consider using the NLTK To check sentance validity
  • []Pull text from facebook/gmail (working on it)
  • []better random weight selection algorithm
  • []Create graph class for simpler future work
  • []Include external graph library
  • []Vizualizing word network for debugging/cuz it would be cool
  • []Add puntuation(in progress)
  • []How to include beginings/ends of sentances???
  • []Add ability to export word graph to JSON for interactive d3 visual
  • [] include n-gram analysis in text analysis instead of multilevel?
  • a python autocorrect module to fix sentances?

gold readings have not desisted From the inevitable place weary of evil to comply either with offers to leader of the loan etc The Scop s handwork I swung as ferocious as ealles and his sorrow that were With weapon Him who complished it The men do it The Project Gutenberg 1 Though high helmet till he at the vapor The horn anon sang the king and Beowulf s son she injured his worth and reported to the weeping and my journey Wild beasts and his hand and beaker battle the daughter named it 40 The Helmingish lady then sadly in

speech_bot's People

Contributors

jacksonhenry3 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.