Giter Site home page Giter Site logo

rickardkarl / learningusingprivilegedtimeseries Goto Github PK

View Code? Open in Web Editor NEW
3.0 3.0 1.0 12.38 MB

This repository is the official implementation of Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

License: MIT License

Python 2.56% Jupyter Notebook 97.44%

learningusingprivilegedtimeseries's Introduction

Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Abstract

We study prediction of future outcomes with supervised models that use privileged information during learning. The privileged information comprises samples of time series observed between the baseline time of prediction and the future outcome; this information is only available at training time which differs from the traditional supervised learning. Our question is when using this privileged data leads to more sample-efficient learning of models that use only baseline data for predictions at test time. We give an algorithm for this setting and prove that when the time series are drawn from a non-stationary Gaussian-linear dynamical system of fixed horizon, learning with privileged information is more efficient than learning without it. On synthetic data, we test the limits of our algorithm and theory, both when our assumptions hold and when they are violated. On three diverse real-world datasets, we show that our approach is generally preferable to classical learning, particularly when data is scarce. Finally, we relate our estimator to a distillation approach both theoretically and empirically.

Requirements

Required libraries found in requirements.txt

Models

Baseline and LUPTS are implemented using sklearn, the code is found in /src/model/

Evaluation

Synthethic

To re-produce experiments, run /notebooks/synthetic.ipynb Necessary experiment code is found in /src/synthetic/

Forecasting Air Quality

To re-produce experiments, run /notebooks/fivecities.ipynb Necessary experiment code is found in /src/fivecities/

The data is found in /data/fivecities/, but can also be downloaded from here.

Modeling Progression of Chronic Disease

Note: For the Alzheimer’s and Multiple myeloma progression modeling tasks, the data is not publicly available, but the code which produced the results is still found in this repository.

Alzheimer's progression modelling

Code is found in /notebooks/ADNI.ipynb and /src/adni/

Multiple myeloma progression modelling

Code is found in /notebooks/mm-prfs.ipynb and /notebooks/mm-tr.ipynb

learningusingprivilegedtimeseries's People

Contributors

rickardkarl avatar martinwillbo avatar zeshanmh avatar

Stargazers

 avatar  avatar Nguyen Trong Nghia avatar

Watchers

 avatar  avatar  avatar

Forkers

valeman

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.