This repository contains python and R implementation of the algorithms proposed in "XGBLC: An Improved Survival Prediction Model Based on XGBoost".
If you find our method is useful, please cite our paper:Baoshan Ma, Ge Yan, Bingjie Chai, Xiaoyu Hou. XGBLC: An Improved Survival Prediction Model Based on XGBoost.Bioinformatics, 2021, https://doi.org/10.1093/bioinformatics/btab675. Contact email: [email protected] (Ge Yan) and [email protected] (Baoshan Ma).
Python version=3.8
R version=4.0.3
survivalsvm version=0.0.5
glmnet version=4.0-2
xgboost version=1.2.0.1
randomForestSRC version=2.9.3
The data used in this research are collected from The Cancer Genome Atlas (TCGA) project and that are publicly available at https://portal.gdc.cancer.gov.
The program is divided into three sections saved in this repository.
-
Data preprocessing: preprocessed data can be used to test the models, the programs can directly process the data downloaded by TCGA.
-
Other_machine_learning_models: several popular survival analysis algorithms are compared with XGBLC. The files in the repository can be used to construct different predictive models to analyze the patients' survival of 20 cancer datasets.
-
XGBLC_model: survival prediction model based on XGBLC was presented to analyze the patients' survival of 20 cancer datasets patients.