paddle-mobile

This research aims at simply deploying deeplearning on mobile and embedded devices, with low complexity and high speed. old name mobile deep learning.

Github星跟踪图

中文版

Paddle Lite

Documentation Status
License

Paddle Lite is an updated version of Paddle-Mobile, an open-open source deep learning framework designed to make it easy to perform inference on mobile, embeded, and IoT devices. It is compatible with PaddlePaddle and pre-trained models from other sources.

For tutorials, please see PaddleLite Document.

Key Features

Light Weight

On mobile devices, execution module can be deployed without third-party libraries, because our excecution module and analysis module are decoupled.

On ARM V7, only 800KB are taken up, while on ARM V8, 1.3MB are taken up with the 80 operators and 85 kernels in the dynamic libraries provided by Paddle Lite.

Paddle Lite enables immediate inference without extra optimization.

High Performance

Paddle Lite enables device-optimized kernels, maximizing ARM CPU performance.

It also supports INT8 quantizations with PaddleSlim model compression tools, reducing the size of models and increasing the performance of models.

On Huawei NPU and FPGA, the performance is also boosted.

The latest benchmark is located at benchmark

High Compatibility

Hardware compatibility: Paddle Lite supports a diversity of hardwares — ARM CPU, Mali GPU, Adreno GPU, Huawei NPU and FPGA. In the near future, we will also support AI microchips from Cambricon and Bitmain.

Model compatibility: The Op of Paddle Lite is fully compatible to that of PaddlePaddle. The accuracy and performance of 18 models (mostly CV models and OCR models) and 85 operators have been validated. In the future, we will also support other models.

Framework compatibility: In addition to models trained on PaddlePaddle, those trained on Caffe and TensorFlow can also be converted to be used on Paddle Lite, via X2Paddle. In the future to come, we will also support models of ONNX format.

Architecture

Paddle Lite is designed to support a wide range of hardwares and devices, and it enables mixed execution of a single model on multiple devices, optimization on various phases, and leight-weighted applications on devices.

img

As is shown in the figure above, analysis phase includes Machine IR module, and it enables optimizations like Op fusion and redundant computation pruning. Besides, excecution phase only involves Kernal exevution, so it can be deployed on its own to ensure maximized light-weighted deployment.

Key Info about the Update

The earlier Paddle-Mobile was designed to be compatible with PaddlePaddle and multiple hardwares, including ARM CPU, Mali GPU, Adreno GPU, FPGA, ARM-Linux and Apple's GPU Metal. Within Baidu, inc, many product lines have been using Paddle-Mobile. For more details, please see: mobile/README.

As an update of Paddle-Mobile, Paddle Lite has incorporated many older capabilities into the new architecture. For the time being, the code of Paddle-mobile will be kept under the directory mobile/, before complete transfer to Paddle Lite.

For demands of Apple's GPU Metal and web front end inference, please see ./metal and ./web . These two modules will be further developed and maintained.

Special Thanks

Paddle Lite has referenced the following open-source projects:

  • ARM compute library
  • Anakin. The optimizations under Anakin has been incorporated into Paddle Lite, and so there will not be any future updates of Anakin. As another high-performance inference project under PaddlePaddle, Anakin has been forward-looking and helpful to the making of Paddle Lite.

Feedback and Community Support

  • Questions, reports, and suggestions are welcome through Github Issues!
  • Forum: Opinions and questions are welcome at our PaddlePaddle Forum
  • WeChat Official Account: PaddlePaddle
  • QQ Group Chat: 696965088

主要指标

概览
名称与所有者PaddlePaddle/Paddle-Lite
主编程语言C++
编程语言CMake (语言数: 12)
平台
许可证Apache License 2.0
所有者活动
创建于2017-09-20 11:41:42
推送于2025-05-22 12:07:25
最后一次提交
发布数39
最新版本名称v2.14-rc (发布于 )
第一版名称v0.9 (发布于 )
用户参与
星数7.1k
关注者数333
派生数1.6k
提交数7.5k
已启用问题?
问题数2536
打开的问题数60
拉请求数6398
打开的拉请求数16
关闭的拉请求数1706
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?