ChineseAddress_OCR

Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。

Github stars Tracking Chart

ChineseAddress_OCR

环境不可控场景下拍照文档地址文字识别

Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction.

This is a project of the 2018 Deecamp 25th group (DRPRG). Thanks to my team members!
这个是 2018年 Deecamp 25组 (深度受限抠图小组) 的项目,非常非常的感谢每一位队友!

Our Demo: https://www.bilibili.com/video/av30081208
Our Wechat Program (微信小程序): OCRdeecamp

Reference Paper: Detecting Text in Natural Image with Connectionist Text Proposal Network
Reference Code: https://github.com/YCG09/chinese_ocr (Thanks to Yang Chenguang)

Method

Text Detection : CTPN
Text Recognition: CTC+DenseNet
Address Judgment: Light GBM or textgrocery
Address Correction: Fuzzy matching based on address library

About Code

demo_final.py
You can simply run demo_final.py for inference. Input a picture and output the Chinese address string.
run_flask.py
Communication between server and Wechat program with flask
ocr_whole.py
Text detection with CTPN, and text recognition with Densenet
stupid_addrs_rev.py
Address correction using fuzzy-matching based on address library
ctpn
If you want to know more details about CTPN codes, please check https://github.com/eragonruan/text-detection-ctpn
wechat_program
Some files of Wechat program (微信小程序的一些文件)

Results

In our dataset, the accuracy of exactly correct is 83%, the accuracy of edit distance less than 3 is 97%.
Our program has high accuracy at identifying very fuzzy multi-line addresses.

If you want to know more details, please read ChineseAddress_OCR_Report.pdf(中文).

Main metrics

Overview
Name With OwnerWalleclipse/ChineseAddress_OCR
Primary LanguagePython
Program languagePython (Language Count: 5)
Platform
License:
所有者活动
Created At2018-10-05 06:27:30
Pushed At2020-01-23 05:36:55
Last Commit At2020-01-23 13:36:54
Release Count0
用户参与
Stargazers Count347
Watchers Count15
Fork Count131
Commits Count32
Has Issues Enabled
Issues Count23
Issue Open Count2
Pull Requests Count0
Pull Requests Open Count0
Pull Requests Close Count0
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private