Giter Site home page Giter Site logo

deeptype's Introduction

DeepType

Deep Learning Approach to Identifying Breast Cancer Subtypes Using High-Dimensional Genomic Data

Code Organization

This software contains these codes:

  • DeepType.py
  • data.py
  • eval.py
  • flags.py
  • model.py
  • training.py
  • utils.py

Requirements

  • python
  • tensorflow

Implement and Activate Tensorflow Environment under Conda

  • Implement: conda create -n tensorflow_env tensorflow

  • Activation: conda activate tensorflow_env

Use the software

1. Data format: filename.mat file

2. Variables:

Data: D*N numerical matrix. Each row is a gene, and each column is a sample. The genes should be ranked in the descending order by variances across samples.

targets: N*1 numerical vector. The ith element denotes the class that the ith sample belongs to.

3. Set parameters in flags.py:

NUM_GENES_1: the number of input genes.

NUM_CLUSTERS: the number of clusters K.

NUM_HIDDEN: the number of hidden layers.

NUM_NODES: numerical vector, the numbers of nodes in the hidden layers.

NUM_CLASSES: the number of unique classes of samples.

NUM_TRAIN_SIZE: the number of samples in the training set.

NUM_VALIDATION_SIZE: the number of samples in the validation set.

NUM_TEST_SIZE: the number of samples in the test set.

NUM_SAMPLE_SIZE: the number of samples in the whole dataset.

NUM_BATCH_SIZE: batch size.

NUM_LEARNING_RATE: learning rate.

NUM_SUPERVISED_BATCHES: the number of training steps in the supervised initialization.

NUM_TRAIN_BATCHES: the number of training steps in each epoch.

LAMBDA: sparsity penalty coefficient.

ALPHA: K-means loss coefficient.

DATA_DIR: Directory to put the training data.

RESULT_DIR: Directory to put the results.

4. Run the program

python DeepType.py

5. Data available

Due to the file size limit of Github, the breast cancer dataset is available at https://drive.google.com/file/d/1ao1zu3DS8GkYF-tHxpQ-1ev2psxXL-fx/view?usp=sharing

deeptype's People

Contributors

runpuchen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.