-
Organization
- There are two directories, each correspond to a simulation in the experiments section of the paper: sim_class_1 (linear decision boundary) and sim_class_2 (quadratic decision boundary).
- Each directory has three sub directories: code, data, and qsub.
- Directory 'code' has files of all the R source code that was used in the analysis.
- Directory 'data' has (if any) simulated data that was used in the analysis. This directory may be empty or absent.
- Directory 'qsub' has SGE files (.q) that were used to submit jobs on a SGE cluster.
-
Files
- 'sim_data_corr.R' contains the code to simulate and partition the data for classification in first simulation.
- 'sim_data_nolin_corr.R' contains the code to simulate and partition the data for classification in second simulation.
- 'sim_data_nonlin_corr_reg.R' contains the code to simulate and partition the data for regression in second simulation.
- 'analyze_result_class.R' contains the code for analyzing the results for classification of GP with the SE kernel, logistic regression and ridge regression and lasso regression with and without SE kernel-based covariates, SVM and KRR with spectrum and boundrange kernels, and competing methods and making tables.
- 'analyze_result_reg.R' contains the code for analyzing the results for regression of GP with the SE kernel, logistic regression and ridge regression and lasso regression with and without SE kernel-based covariates, SVM and KRR with spectrum and boundrange kernels, and competing methods and making tables.
- 'gp_logistic.R' contains the code for the GP classification with the SE kernel.
- 'submit.R' contains the code for the R code for submitting a job on the cluster. The files in 'qsub' directory use this file for running simulations.
-
Citation: If you use the code, then please cite the following paper:
- Sanvesh Srivastava, Zongyi Xu, Yunyi Li, Nick Street, and Stephanie Gilbertson-White (2020+). Gaussian Process Regression and Classification using International Disease Classification Codes as Covariates. Submitted to Biometrics.
-
Contact: Please email Sanvesh Srivastava ([email protected]) if you have any questions related to the code.
-
Acknowledgment Office of Naval Research (ONR-BAA N000141812741) and the National Science Foundation (DMS-1854667/1854662).
-
NOTE: We are unable to provide the code for real data analysis due to privacy concerns.
ehr-data-analysis's Introduction
ehr-data-analysis's People
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.