Giter Site home page Giter Site logo

Comments (2)

lovelyzzc avatar lovelyzzc commented on April 23, 2024 1

我前面的问题都已经解决了

loss:在datasets/DetDataSet.py 72行
这一行针对讯飞的数据存在bug
讯飞的数据没有text,所以每一个gt都被跳过了,导致最后训练部分loss为0

其次在datasets/det_modules/random_crop_data.py 57行附近
data['text_polys'] = np.float32(text_polys_crop)
会出现错误
这个错误是因为原本的这一部分针对的是EAST模型的数据来做的,在EAST模型中始终使用4个点来做标记,但是在讯飞的数据里面,点是自适应的,存在大于4个点的情况,直接使用np.float32()就会存在错误,

在这里我的改法是在前面5行
text_polys_crop.append(poly)处,改为
text_polys_crop.append(np.float32(poly))

data['text_polys'] = np.float32(text_polys_crop) 处改为
data['text_polys'] = np.array(text_polys_crop)

至于第二个问题是因为我模型并没有训练好
很感谢作者提供的模型,让我对论文能够理解更深,
谢谢谢谢

from pytorchocr.

novioleo avatar novioleo commented on April 23, 2024

glad to hear that. @lovelyzzc

from pytorchocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.