Giter Site home page Giter Site logo

dynamic-link-prediction's Introduction

Link Prediction: Reddit posts Suggestion

Dataset

The Reddit dataset is a Public graph dataset from Reddit posts made in the month of September, 2014. The node label in this case is the community, or “subreddit”, that a post belongs to. 50 large communities have been sampled to build a post-to-post graph, connecting posts if the same user comments on both. In total this dataset contains 232,965 posts with an average degree of 492. The first 20 days are used for training and the remaining days for testing (with 30% used for validation). For features, off-the-shelf 300-dimensional GloVe CommonCrawl word vectors are used.

The Downstream Task: Link prediction

Train a Graph Neural Network model to predict whether there are links between users and subreddit

Details about the project

  • Reimplement the MLP-mixer - based Graph model called GraphMixer
  • Implement the Downstream task - Link prediction on the Graph model
  • Process data and feed data to model for training, validating, and testing
  • Using DVC tool to build a simple pipelines for reproducibility and version dataset
  • Build a CI/CD workflow with Github Actions

Train the model

Train the model by using this command:

python train_link_prediction.py dataset_name reddit --model_name GraphMixer

Or use the reproduce the DVC pipeline that is set up in dvc.yaml

dvc repro

References:

dynamic-link-prediction's People

Contributors

phamthanhtu310702 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.