ooooverflow / chinese-ocr Goto Github PK
View Code? Open in Web Editor NEW基于CTPN(tensorflow)+CRNN(pytorch)+CTC的不定长文本检测和识别
基于CTPN(tensorflow)+CRNN(pytorch)+CTC的不定长文本检测和识别
def __getitem__(self, index):
label = [int(x) for x in keys]
请问label是要识别的str,怎么转成int呢?
(w,h) = Data.size
size_h = 32
ratio = 32 / float(h)
size_w = int(w * ratio)
transform = resizeNormalize((size_w,size_h))
请问你是直接用这段代码形成的比例进行的缩放吗a,这样是不是相当于没有缩放
我看作者里的测试图片是身份证,请问训练的时候用的样本是身份证样本吗?还是网上的官方样本呢?
我的环境安装不了百度云盘,谢谢
crnn数据集均为数字,训练时精准度全为零,有训练成功的大佬吗?可以知道一下如何拿博主提供的数据集训练
我看到364万训练集中,英文的单词比较少,请问下模型对英文的识别效果如何
hello,请问你的不定长训练是什么意思呢,直接w不设置吗?求指教
can not open checkpoint.zip, can you reload checkpoint please? Thanks!
change:
gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=1.0)
[via]
into :
gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.9)
config = tf.ConfigProto(allow_soft_placement=True, gpu_options=gpu_options)
config.gpu_options.allow_growth = True
Expected in: flat namespace
in /Users/*/CHINESE-OCR-master/ctpn/lib/utils/bbox.so
有人知道怎么解决吗
你好,我使用了360万数据集进行crnn训练,训练精度很高,但是验证精度却非常低,模型任何文字都识别不出来,请问这是哪里出了问题,我该如何修改?谢谢
请问博主您是否训练成功,我这里一直在报错,我尝试修改很多次,但是依旧没有解决!
报错信息:
RuntimeError: invalid argument 0: Sizes of tensors must match except in dimension 0. Got 57 and 213 in dimension 3 at /pytorch/aten/src/TH/generic/THTensorMath.cpp:3616
请您指导下,谢谢!
看您的代码,似乎这个数据集连接中包含了中文数据集,可否私发一份下载连接,我的邮箱:[email protected],谢谢
您好,请问可以识别非简体的汉字吗,比如说篆体等等,如果能够识别能请您大体说一下如何做吗,非常感谢了
ImportError:cannot import name 'bbox'
安装还是不成功,请问 要怎样修改?
生成项目“test_gpu.vcxproj”的操作 - 失败。 生成项目“test_cpu.vcxproj”的操作 - 失败。 这2个不影响吧?
warp-ctc是不是只支持linux?不支持win10安装吗? 谢谢
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.