tf识别非固定长度图片ocr（数字+字母 n位长度可变）- CNN+RNN+CTC-白红宇

tf识别非固定长度图片ocr（数字+字母 n位长度可变）- CNN+RNN+CTC

阅读量：740 次

发布时间：2019-03-22

本文共 593 字，大约阅读时间需要 1 分钟。

安装所需库，确保TensorFlow和OpenCV等依赖已满足。下载并解压数据集，注意数据集体积较大，解压完成可能需要一天时间。

使用bash脚本处理图片列表，生成训练集和验证集：

find ./mnt/ | xargs ls -d | grep jpg > image_list_all.txt  image_list_all.txt | head -n 1000 > image_list.txt

将图片文件作为输入，通过TensorFlow生成tfrecord文件：

python create_synth90k_tfrecord.py --image_dir C:\Users\McKay\PycharmProjects\test8\tfdemo\data --anno_file ./image_list.txt --char_map_json_file ../char_map/char_map.json

训练参数设置：

--data_dir ../data/tfrecords/ --model_dir ./model/ --batch_size 32 --char_map_json_file ../char_map/char_map.json

训练过程中遇到GPU资源限制，目前仅获得了7%的准确率。由于时间和资源限制，决定中断后续训练，等待后续GPU资源再进行训练。

转载地址：http://mckwk.baihongyu.com/

你可能感兴趣的文章