Giter Site home page Giter Site logo

redis-developer / rediner Goto Github PK

View Code? Open in Web Editor NEW
2.0 3.0 1.0 122 KB

Named Entity Recognition for Redis

Home Page: https://redis-developer.github.io/rediner/

License: MIT License

Python 97.59% Dockerfile 0.48% Shell 1.93%
nlp redis spacy-nlp named-entity-recognition flask serverless graph property-graph redisgraph

rediner's Introduction

rediner

Named Entity Recognition (NER) analysis in RedisGraph.

Overview

This project contains an example of using the SpaCy NER model to harvest named entities from blog posts or other web pages. The crawler program produces graph structures that can be loaded into RedisGraph for analysis via Cypher Queries.

Demo

A demo is available online. Please be kind!

A Quick Start

  1. Create an environment an load the requirements:

    pip install -r requirements.txt
    python -m spacy download en_core_web_sm
    
  2. Collect some entries:

    mkdir out
    python -m rediner.blog  --same --store --dir out --verbose https://www.milowski.com/
    
  3. Run the SpaCy model:

    python -m rediner --yaml -r ner out > milowski-com-terms.yaml
    
  4. Start RedisGraph:

    docker run -p 6379:6379 redislabs/redisgraph:latest
    
  5. Setup the indexes:

    python demo/setupdb.py milowski.com
    
  6. Ingest the data:

    python -m rediner load -r --graph milowski.com out
    python -m rediner load -r --graph milowski.com milowski-com-terms.yaml
    
  7. Run the demo application:

    cd demo
    python view.py
    
  8. View the application locally at http://localhost:5000/

Once the application is running, you can examine the dataset in various ways. For starters, try loading the graphs with the defaults by clicking on the 'Load' button. This will load the entities that match the minimum counts and show a graph of cooccurrences.

You can:

  • use the "same" checkbox to enable entities that co-occur on the same article.
  • use the "multiple" checkbox to enable entities that co-occur on more than one article.
  • use search to access the full text search. This will display a subgraph of entities from the search results.
  • clicking on a node in the graph will show the articles which contain the entities with a link back to the original resource.

Next steps

You can view more extensive document on the website.

rediner's People

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

bsbodden

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.