text-detection-ctpn

Github地址

这是我开源在github上的一个场景文本检测的模型，主要基于CTPN，可以用来检测水平的文本，如身份证之类的。详见github

text detection mainly based on ctpn (connectionist text proposal network). It is implemented in tensorflow. I use id card detect as an example. the origin paper can be found here. Also, the origin repo can be found in here. This repo is mainly based on faster rcnn framework, so there remains tons of useless code. I’m still working on it.

prepare

First, download the pre-trained model of VGG net and put it in data/pretrain/VGG_imagenet.npy. you can download it from google drive.

Second, prepare the training data as referred in paper, or you can download the data I prepared in here. Modify the path and gt_path in prepare_training_data/split_label.py according to your dataset. And run

cd prepare_training_data
python split_label.py

it will generate the prepared data in current folder, and then run

to convert the prepared training data into voc format. It will generate a folder named TEXTVOC. move this folder to data/ and then run

cd ../data
ln -s TEXTVOC VOCdevkit2007

train

Simplely run

you can modify some hyper parameters in ctpn/text.yml, or just used the parameters I set.

demo

put your images in data/demo, the results will be saved in data/results, and run

some results

NOTICE:

all the photos used below are collected from the internet. If it affects you, please contact me to delete them.

场景文本检测，CTPN tensorflow版本text-detection-ctpnpreparetraindemosome results

场景文本检测，CTPN tensorflow版本text-detection-ctpnpreparetraindemosome results

text-detection-ctpn

prepare

train

demo

some results

继续阅读

考证大全 | 证券从业资格考试

敲黑板！2021年证券从业考试考点预测

2021年银行从业考试考情介绍,果断收藏!

证券从业合格证书什么时候打印？有哪些注意事项？

【干货满满】初级银行从业考试《个人理财》重点梳理

2020年经济师考试，难吗？

初级银行从业资格证有什么用？

MBA提前面试纯干货分享

MBA值得学么

吴恩达logistic回归实现

【人工智能行业大师访谈1】吴恩达采访 Geoffery Hinton

深度学习模型分析人类复杂疾病的准确性

【趋高机器视觉】机器视觉技术原理解析及解决方案

解码器用于语义分割：数据依赖的解码可以实现灵活的特征聚合

cs231n斯坦福基于卷积神经网络的CV学习笔记（一）KNN和线性分类器/分类器损失/反向传播一，KNN图像分类算法二，线性分类器三，线性分类器损失四，反向传播五，神经网络

开源按键组件Multi_Button的使用,含测试工程