Giter Site home page Giter Site logo

deeperlib's Introduction

DeepER - Deep Entity Resolution

Travis David

A web data integration tool, A novel framework to overcome limitations, Easy for configuration, Fully functional, Smooth interface.

which aims to find pairs of records that describe the same entity between a local database and a hidden database and has many applications in data enrichment and data cleaning.

API Support

DeepER is ready for the following API:

  • DBLP(DataBase systems and Logic Programming)
  • YELP(Yelp Fusion API)
  • AMiner(arnetminer)

Custom

implement a subclass of deeper.api.simapi and pass it to deeper.core.smartcrawl and you would integrate a new api to collect more data.

Documentation

Fantastic documentation is available at https://sfu-db.github.io/deeperlib/

Requirements

  • pqdict>=1.0.0
  • requests>=2.18.4
  • simplejson>=3.11.1
  • rauth>=0.7.3

Requests officially supports Python 2.7.13, and runs great on PyPy.

Installation and Update

pip install deeperlib
pip install --upgrade deeperlib

Changelog

v0.2a

  • 2017/09/19 support Windows-32bit/64bit, Linux-32bit/64bit, MacOs-64bit, csv and pickle input

v0.1a

  • 2017/09/14 deeper's birthday

Team

  • Jiannan Wang, Assistant Professor at Simon Fraser University
  • Eugene Wu, Assistant Professor at Columbia University
  • Ryan Shea, Research Associate at Simon Fraser University
  • Pei Wang, Ph.D. Student at Simon Fraser University
  • Yongjun He, Undergraduate Student at Nanjing University

Discussing

Maintainer email
Yongjun He [email protected]

deeperlib's People

Contributors

togethergenai avatar peiwangdb avatar jnwang avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.