Giter Site home page Giter Site logo

text-prediction's Introduction

Text-Prediction

This program implements a statistical trigram language model with NLTK for text prediction based on the Alice in Wonderland corpus.

Getting started

  1. clone or download this repository
git clone https://github.com/jadessechan/Text-Prediction.git
  1. run main.py
  2. once prompted by the program, enter a phrase related to the corpus

Demo

Lines 80-86 display n-gram statistics of the corpus and are commented-out by default.

Here is a frequency distribution plot of the most common 30 trigrams: frequency distribution of the top 30 trigrams

Here is an example of the program output: demo image of running program

final output of demo:

User input: alice said to the
Prediction: alice said to the table, half hoping she might find another (comma was added for readability)
What did alice want to find again?? The suspense...๐Ÿ˜–

Implementation

I used NLTK's probability library to store the probability of each predicted word,

ConditionalFreqDist()

then the program picks from a weighted random probability to decide which prediction to append to the given phrase.

random.choices()

The user decides when to stop the program by choosing whether or not to predict the next word.

"Do you want to generate another word? (type 'y' for yes or 'n' for no): "

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.