The layrub's intro from cgcl-codes

View Code? Open in Web Editor NEW

Layrub is a runtime data placement strategy for extreme-scale net training. It is developed on BVLC Caffe, and achieves memory savings more than 50% over BVLC Caﬀe.

License: Other

CMake 2.83% Makefile 0.66% Shell 0.48% C++ 80.34% Cuda 5.61% MATLAB 0.89% Python 9.12% Dockerfile 0.08%

layrub's Introduction

Layrub

Introduction

Layrub provides a runtime data placement strategy for extreme-scale training. It achieves layer-centric memory reuse to reduce memory consumption for extreme-scale DNN models that could not previously be run on a single GPU. We deploy Layrub on the basis of BVLC Caffe(https://github.com/BVLC/caffe).

Feature

- Intra-layer strategy: demonstrates the opportunities for memory reuse at the intra-layer level, which works effectively for wide networks, using the memory space of activation data as the space for gradient data.
- Inter-layer strategy: demonstrates the opportunities for memory reuse at the inter-layer level, which works effectively for deep networks.
- Hybrid strategy: combines both intra-layer and inter-layer strategy, which works perfectly with most DNNs.
Compared to the original Caffe, Layrub can cut down the memory usage rate by an average of 58.2% and by up to 98.9%, at the moderate cost of 24.1% higher training execution time on average.
Layrub on Caffe is user-friendly. The changes on source code are transparent to users. The use of C++ interface is not different from Caffe.

Installation

Please refer to the installation steps in BVLC Caffe(http://caffe.berkeleyvision.org/installation.html)
Note: This version only modified and tested the C++ code.

Reference Paper

Bo Liu, Wenbin Jiang, Hai Jin, Xuanhua Shi, and Yang Ma. Layrub: Layer-centric GPU memory reuse and data migration in extreme-scale deep learning systems. In Proceedings of ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Vienna, Austria, February 24¨C28, 2018 (PPoPP'18 poster paper).

Hai Jin, Bo Liu, Wenbin Jiang, Yang Ma, Xuanhua Shi, Bingsheng He, and Shaofeng Zhao. 2018. Layer-centric memory reuse and data migration for extreme-scale deep learning on many-core architectures. ACM Transactions on Architecture and Code Optimization. 15, 3 (2018), 37:1–37:26

Support Or Contact

If you have any questions, please contact Yang Ma ([email protected])

Recommend Projects

cgcl-codes / layrub Goto Github PK

layrub's Introduction

Layrub

Introduction

Feature

Installation

Reference Paper

Support Or Contact

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent