Giter Site home page Giter Site logo

automated-electronic-health-record-based-detection-of-ards's Introduction

Automated Electronic Health Record Based detection of ARDS

This repository provides the instructions of how to apply the logistic regression model trained on patients hospitalized with acute hypoxic respiratory failure at a single center in January 1, 2016 through December 31, 2017 to your data to automatically detect ARDS.

Getting Started

Prerequisites

Jupyter Notebook https://jupyter.org/install

Python 3 https://www.python.org/downloads/

Apache cTAKES 4.0 https://cwiki.apache.org/confluence/display/CTAKES/cTAKES+4.0+User+Install+Guide

Installing dependencies

Use pip installation to install dependencies from requirements.txt

$ pip install -r requirements.txt

Preparing the datasets

Step 1: Structured Data

  • Create a structured dataset named as "structured_data.csv" with all the variables listed in Structured Data Variable.docx.

  • Variable names have to be exactly the same as listed. Do not carry forward the values unless instructed.

  • Data time needs to be within 10 days since admit date.

Step 2: cTAKES Features

  • Create two cTAKES Features datasets with names "radiology_report_ctakes_features.csv" and "clinical_notes_ctakes_features.csv".

  • To make these two datasets we need to have a radiology report dataset and a clinical notes dataset to start with. Both datasets should have the following columns:

    โ€‹ 'EncounterID','time','NOTE_TEXT','NOTE_ID'

  • Data time needs to be within 10 days since admit date.

  • When these two datasets are ready, follow the instruction in the "process ctakes_publication" notebook to create "radiology_report_ctakes_features.csv" and "clinical_notes_ctakes_features.csv".

Step 3: Final dataset

  • After step1 and step2 are done, run script "Pre-processing data-publication" to create the final datasets "structured_2Htest.csv" and "structured+radiology50+clinical_notes250_2Htest.csv"

Prediction

Run script "Prediction" to get the predicted probabilities of ARDS.

Authors

  • XXX
  • XXX
  • XXX

License

License

Acknowledgments

Acknowledgements

automated-electronic-health-record-based-detection-of-ards's People

Contributors

sjodinglab avatar

Stargazers

 avatar  zach jerzy avatar

Watchers

 avatar

Forkers

zy20030535

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.