Giter Site home page Giter Site logo

hephaex / mnn Goto Github PK

View Code? Open in Web Editor NEW

This project forked from alibaba/mnn

0.0 1.0 0.0 62.45 MB

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

CMake 0.84% Ruby 0.03% Shell 0.29% C++ 79.72% C 6.20% Python 3.72% Objective-C 0.06% Objective-C++ 2.60% Assembly 3.86% Metal 2.26% GLSL 0.39% PowerShell 0.03% Batchfile 0.01%

mnn's Introduction

MNN

中文版本

Intro

MNN is a highly efficient and lightweight deep learning framework. It supports inference and training of deep learning models, and has industry leading performance for inference and training on-device. At present, MNN has been integrated in more than 20 apps of Alibaba Inc, such as Taobao, Tmall, Youku, Dingtalk, Xianyu and etc., covering more than 70 usage scenarios such as live broadcast, short video capture, search recommendation, product searching by image, interactive marketing, equity distribution, security risk control. In addition, MNN is also used on embedded devices, such as IoT.

The design principles and performance data of MNN has been published in an MLSys 2020 paper here. Please cite MNN in your publications if it helps your research:

@inproceedings{alibaba2020mnn,
  author = {Jiang, Xiaotang and Wang, Huan and Chen, Yiliu and Wu, Ziqi and Wang, Lichuan and Zou, Bin and Yang, Yafeng and Cui, Zongyang and Cai, Yu and Yu, Tianhang and Lv, Chengfei and Wu, Zhihua},
  title = {MNN: A Universal and Efficient Inference Engine},
  booktitle = {MLSys},
  year = {2020}
}

Documentation

MNN's docs are in placed in Yuque docs here.

Key Features

High performance

  • Implements core computing with lots of optimized assembly code to make full use of the ARM CPU.
  • For iOS, GPU acceleration (Metal) can be turned on, which is faster than Apple's native CoreML.
  • For Android, OpenCL, Vulkan, and OpenGL are available and deep tuned for mainstream GPUs (Adreno and Mali).
  • Convolution and transposition convolution algorithms are efficient and stable. The Winograd convolution algorithm is widely used to better symmetric convolutions such as 3x3 -> 7x7.
  • Twice speed increase for the new architecture ARM v8.2 with FP16 half-precision calculation support.

Lightweight

  • Optimized for devices, no dependencies, can be easily deployed to mobile devices and a variety of embedded devices.
  • iOS platform: static library size for armv7+arm64 platforms is about 5MB, size increase of linked executables is about 620KB, and metallib file is about 600KB.
  • Android platform: core so size is about 400KB, OpenCL so is about 400KB, Vulkan so is about 400KB.

Versatility

  • Supports Tensorflow, Caffe, ONNX, and supports common neural networks such as CNN, RNN, GAN.
  • MNN model converter supports 149 Tensorflow OPs, 58 TFLite OPs, 47 Caffe OPs and 74 ONNX OPs; Number of OPs by different MNN hardware backends: 111 for CPU, 6 for ARM V8.2, 55 for Metal, 43 for OpenCL, and 32 for Vulkan.
  • Supports iOS 8.0+, Android 4.3+ and embedded devices with POSIX interface.
  • Supports hybrid computing on multiple devices. Currently supports CPU and GPU.

Ease of use

  • Efficient image processing module, speeding up affine transform and color space transform without libyuv or opencv.
  • Provides callbacks throughout the workflow to extract data or control the execution precisely.
  • Provides options for selecting inference branch and paralleling branches on CPU and GPU.
  • (BETA) MNN Python API helps ML engineers to easily use MNN to build a model, train it and quantize it, without dipping their toes in C++ code.

Architecture

architecture

MNN can be divided into two parts: Converter and Interpreter.

Converter consists of Frontends and Graph Optimize. The former is responsible for supporting different training frameworks. MNN currently supports Tensorflow, Tensorflow Lite, Caffe and ONNX (PyTorch/MXNet); the latter optimizes graphs by operator fusion, operator substitution, and layout adjustment.

Interpreter consists of Engine and Backends. The former is responsible for the loading of the model and the scheduling of the calculation graph; the latter includes the memory allocation and the Op implementation under each computing device. In Engine and Backends, MNN applies a variety of optimization schemes, including applying Winograd algorithm in convolution and deconvolution, applying Strassen algorithm in matrix multiplication, low-precision calculation, Neon optimization, hand-written assembly, multi-thread optimization, memory reuse, heterogeneous computing, etc.

How to Discuss and Get Help From MNN Community

Scan the following QR codes to join Dingtalk discussion group. The group discussions are predominantly Chinese. But we welcome and will help English speakers.

Group #1 (Full)

Group #2:

License

Apache 2.0

Acknowledgement

MNN participants: Taobao Technology Department, Search Engineering Team, DAMO Team, Youku and other Alibaba Group employees.

MNN refers to the following projects:

mnn's People

Contributors

mnnteam avatar jxt1234 avatar li-qing avatar naville avatar proydakov avatar interfish avatar daquexian avatar zzz197 avatar krayzemli avatar yyfcc17 avatar chrisyooh avatar guanmoyu avatar howave avatar muare avatar stanleywang8888 avatar czy2014hust avatar nihui avatar zjd1988 avatar codingboo avatar chosungmann avatar sugaryou avatar sunbohong avatar hush-alibaba avatar lldong avatar zhijl avatar yisongsong avatar twmht avatar smallt-tao avatar qunluo avatar maybeshewill-cv avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.