E2E-MLT

E2E-MLT -- 用于多语言场景文本的不受约束的端到端方法，代码库为：https://arxiv.org/abs/1801.09919

@@inproceedings{buvsta2018e2e, title={E2E-MLT-an unconstrained end-to-end method for multi-language scene text}, author={Bu{\v{s}}ta, Michal and Patel, Yash and Matas, Jiri}, booktitle={Asian Conference on Computer Vision}, pages={127--143}, year={2018}, organization={Springer} }

名稱與所有者	MichalBusta/E2E-MLT
主編程語言	C++
編程語言	Python (語言數: 6)
平台	Linux, Mac, Windows
許可證	MIT License

名稱與所有者

MichalBusta/E2E-MLT

主編程語言

C++

編程語言

Python (語言數: 6)

平台

Linux, Mac, Windows

許可證

MIT License

創建於	2018-10-16 21:12:19
推送於	2025-01-08 17:27:36
最后一次提交	2022-08-21 01:15:29
發布數	0

創建於

2018-10-16 21:12:19

推送於

2025-01-08 17:27:36

最后一次提交

2022-08-21 01:15:29

發布數

星數	296
關注者數	14
派生數	83
提交數	39
已啟用問題?
問題數	76
打開的問題數	31
拉請求數	3
打開的拉請求數	0
關閉的拉請求數	0

星數

296

關注者數

派生數

提交數

已啟用問題?

問題數

打開的問題數

拉請求數

打開的拉請求數

關閉的拉請求數

已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?

已啟用Wiki?

已存檔?

是復刻?

已鎖定?

是鏡像?

是私有?

E2E-MLT

E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
code base for: https://arxiv.org/abs/1801.09919

@@inproceedings{buvsta2018e2e,
  title={E2E-MLT-an unconstrained end-to-end method for multi-language scene text},
  author={Bu{\v{s}}ta, Michal and Patel, Yash and Matas, Jiri},
  booktitle={Asian Conference on Computer Vision},
  pages={127--143},
  year={2018},
  organization={Springer}
}

Requirements

python3.x with
opencv-python
pytorch 0.4.1
torchvision
warp-ctc (https://github.com/SeanNaren/warp-ctc/)

Pretrained Models

e2e-mlt, e2e-mlt-rctw

wget http://ptak.felk.cvut.cz/public_datasets/SyntText/e2e-mlt.h5

Running Demo

python3 demo.py -model=e2e-mlt.h5

Data

ICDAR MLT Dataset
ICDAR 2015 Dataset
RCTW-17
Synthetic MLT Data (Arabic, Bangla, Chinese, Japanese, Korean, Latin, Hindi )
and converted GT to icdar MLT format (see: http://rrc.cvc.uab.es/?ch=8&com=tasks)
(Arabic, Bangla, Chinese, Japanese, Korean, Latin, Hindi )

MLT SynthSet

Synthetic text has been generated using Synthetic Data for Text Localisation in Natural Images, with minor changes for Arabic and Bangla script rendering.

What we have found useful:

for generating Arabic Scene Text: https://github.com/mpcabd/python-arabic-reshaper
for generating Bangla Scene Text: PyQt4
having somebody who can read non-latin scripts: we would like to thank Ali Anas for reviewing generated Arabic scene text.

Training

python3 train.py -train_list=sample_train_data/MLT/trainMLT.txt -batch_size=8 -num_readers=5 -debug=0 -input_size=512 -ocr_batch_size=256 -ocr_feed_list=sample_train_data/MLT_CROPS/gt.txt

Acknowledgments

Code borrows from EAST and DeepTextSpotter

E2E-MLT

Github星跟蹤圖

E2E-MLT

要求

预训练模型

运行演示

数据

训练

致谢

主要指標

E2E-MLT

Requirements

Pretrained Models

Running Demo

Data

Training

Acknowledgments