mobile-semantic-segmentation

Real-Time Semantic Segmentation in Mobile device

Github星跟踪图

Real-Time Semantic Segmentation in Mobile device

This project is an example project of semantic segmentation for mobile real-time app.

The architecture is inspired by MobileNetV2 and U-Net.

LFW, Labeled Faces in the Wild, is used as a Dataset.

The goal of this project is to detect hair segments with reasonable accuracy and speed in mobile device. Currently, it achieves 0.89 IoU.

About speed vs accuracy, more details are available at my post.

Example of predicted image.

Example application

  • iOS
  • Android (TODO)

Requirements

  • PyTorch 0.4
  • CoreML for iOS app.

About Model

At this time, there is only one model in this repository, MobileUNet.py. As a typical U-Net architecture, it has encoder and decoder parts, which consist of depthwise conv blocks proposed by MobileNets.

Input image is encoded to 1/32 size, and then decoded to 1/2. Finally, it scores the results and make it to original size.

Steps to training

Data Preparation

Data is available at LFW. To get mask images, refer issue #11 for more. After you got images and masks, put the images of faces and masks as shown below.

data/
  raw/
    images/
      0001.jpg
      0002.jpg
    masks/
      0001.ppm
      0002.ppm

Training

If you use 224 x 224 as input size, pre-trained weight of MobileNetV2 is available. Download it from A PyTorch implementation of MobileNetV2 and put weight file under weights directory.

python train_unet.py \
  --img_size=224 \
  --pre_trained='weights/mobilenet_v2.pth.tar'

If you use other input sizes, the model will be trained from scratch.

python train_unet.py --img_size=192

Dice coefficient is used as a loss function.

Pretrained model, Input size, IoU, Download, ----, ----, ---, 224, 0.89, Google Drive, ## Converting

As the purpose of this project is to make model run in mobile device, this repository contains some scripts to convert models for iOS and Android.

TBD

  • Report speed vs accuracy in mobile device.
  • Convert pytorch to Android using TesorFlow Light

主要指标

概览
名称与所有者hb-chen/echo-web
主编程语言Go
编程语言Python (语言数: 4)
平台
许可证MIT License
所有者活动
创建于2016-03-09 11:00:30
推送于2019-05-29 01:14:34
最后一次提交2019-05-29 09:08:16
发布数1
最新版本名称v3.3.10 (发布于 )
第一版名称v3.3.10 (发布于 )
用户参与
星数415
关注者数26
派生数98
提交数169
已启用问题?
问题数15
打开的问题数1
拉请求数0
打开的拉请求数0
关闭的拉请求数0
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?