Giter Site home page Giter Site logo

Comments (9)

dengfenglai321 avatar dengfenglai321 commented on April 25, 2024

使用的是绝对路径。图片读取,label读取是正确的。。。,

from pytorchocr.

novioleo avatar novioleo commented on April 25, 2024

麻烦查看下你的卷积层中哪个环节数据错了。你这个大概率就是图片加载错误,或者其他类型的错误。简而言之输入到网络中的图大概率有nan

from pytorchocr.

dengfenglai321 avatar dengfenglai321 commented on April 25, 2024

麻烦查看下你的卷积层中哪个环节数据错了。你这个大概率就是图片加载错误,或者其他类型的错误。简而言之输入到网络中的图大概率有nan

数据集加载没有问题,此外,换了数据集,换成IIIT5K数据集,训练准确率也是一直都是000000000

from pytorchocr.

novioleo avatar novioleo commented on April 25, 2024

那你能debug看下是哪个环节不正常的吗?你这样说我们没法判断。 @cendelian

from pytorchocr.

WenmuZhou avatar WenmuZhou commented on April 25, 2024

建议先使用数据数据集看看,IIIT5K数据集太小,欠拟合也正常

from pytorchocr.

dengfenglai321 avatar dengfenglai321 commented on April 25, 2024

建议先使用数据数据集看看,IIIT5K数据集太小,欠拟合也正常

我打印输出如下:

class RecMetric:
    def __init__(self, converter):
        """
        文本识别相关指标计算类

        :param converter: 用于label转换的转换器
        """
        self.converter = converter

    def __call__(self, predictions, labels):
        n_correct = 0
        norm_edit_dis = 0.0
        predictions = predictions.softmax(dim=2).detach().cpu().numpy()
        # print('prediction is {}'.format(predictions))
        preds_str = self.converter.decode(predictions)
        print('preds_str is {}'.format(preds_str))

训练一开始 pred_str有数据,过了两三次就输出都是空
image

我的digit.txt 一共有62个字符(IIIT5K 62个字符)如下:
image

from pytorchocr.

dengfenglai321 avatar dengfenglai321 commented on April 25, 2024

那你能debug看下是哪个环节不正常的吗?你这样说我们没法判断。 @cendelian

大佬,rec_train.py是不是有bug?。。
梯度清零可以在模型推理前吗?

我把 # 清零梯度及反向传播
optimizer.zero_grad()
移至
loss_dict['loss'].backward()
torch.nn.utils.clip_grad_norm_(net.parameters(), 5)

训练就不是准确率一直为0了,,,,
image

from pytorchocr.

Aionrichman avatar Aionrichman commented on April 25, 2024

那你能debug看下是哪个环节不正常的吗?你这样说我们没法判断。 @cendelian

大佬,rec_train.py是不是有bug?。。
梯度清零可以在模型推理前吗?

我把 # 清零梯度及反向传播
optimizer.zero_grad()
移至
loss_dict['loss'].backward()
torch.nn.utils.clip_grad_norm_(net.parameters(), 5)

训练就不是准确率一直为0了,,,,
image

请问移动后代码是怎样的?可以说详细点吗

from pytorchocr.

dengfenglai321 avatar dengfenglai321 commented on April 25, 2024

那你能debug看下是哪个环节不正常的吗?你这样说我们没法判断。 @cendelian

大佬,rec_train.py是不是有bug?。。
梯度清零可以在模型推理前吗?
我把 # 清零梯度及反向传播
optimizer.zero_grad()
移至
loss_dict['loss'].backward()
torch.nn.utils.clip_grad_norm_(net.parameters(), 5)

训练就不是准确率一直为0了,,,,
image

请问移动后代码是怎样的?可以说详细点吗
我发现跟这个没有关系。。。。。训练几轮准确率就有变化,,,,准确率很高,但是评估的时候很不准。。。。。。。。。

from pytorchocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.