Giter Site home page Giter Site logo

zfxxfeng / cnn_lstm_ctc_ocr_for_icpr Goto Github PK

View Code? Open in Web Editor NEW
64.0 64.0 34.0 2.01 MB

Forked from weinman/cnn_lstm_ctc_ocr for the ICPR MTWI 2018 challenge 1

License: GNU General Public License v3.0

Python 99.17% Makefile 0.83%
cnn-lstm-ctc ocr recognition

cnn_lstm_ctc_ocr_for_icpr's People

Contributors

zfxxfeng avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

cnn_lstm_ctc_ocr_for_icpr's Issues

denseNet

Maybe the model denseNet is not convergent. I think denseNet need some improvement.

validate.py使用问题

请问当初你是如何使用validate.py对整个测试集进行遍历呢?因为我看到输入图像要求维度是(31,?,1)而显然需要对测试数据进行resize,那么请问你用什么方法对测试数据进行resize呢?我直接用了image.resize((image.size[0], 31),最终效果并不好,请问你是用什么方法呢?

crop出来的文本图片不正常

你好,你的生成文本图片的函数,生成的文本图片感觉都是做了镜像?
本来水平文本都镜像了?
这样回影响训练吗?

How many steps do you run when training denseNet on IPCR dataset?

Much thanks to your work!

I haved trained the network for 70k steps and the effect is still bad. As your uploaded model's step is 1405532, I really need a long way to go, right?

Another question is, model.ckpt-1405532 will predict 5530 characters, but the word_dict.py contains 16389 characters. Where's the corresponding word_dict for the demo model?

如何利用jpg文件和txt文件生成tfrecord?

我下载了标注文件,有jpg图片和txt标注,怎么生成tfrecord呢?用mjsynth-tfrecord.py的话,里面需要的annotation_val.txt文件是什么内容,我看后续处理是文件名 xxx,这后面xxx是什么iamge_texts?是干嘛的

target_cut.py 可能有点问题

首先感谢分享.

Image_list = os.listdir(Img_dataset_dir)

Label_list = os.listdir(Label_dataset_dir)
直接读取文件名可能会用顺序不一样的问题, macos python3下以及centos python3下都会有, windows下好像不会

然后处理结果都是镜像翻转正常么...

There is some problems in demo, which some weights in checkpoint not found.

NotFoundError (see above for traceback): Key rnn/bdrnn2/fw/lstm_cell/weights not found in checkpoint
[[Node: save/RestoreV2_39 = RestoreV2[dtypes=[DT_FLOAT], _device="/job:localhost/replica:0/task:0/cpu:0"](_recv_save/Const_0, save/RestoreV2_39/tensor_names, save/RestoreV2_39/shape_and_slices)]]

Can you know how to solve it?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.