Giter Site home page Giter Site logo

Comments (5)

qiaoliang6 avatar qiaoliang6 commented on September 2, 2024

公开出来的版本在后处理的时候确实没有使用gpma的分割结果。主要是考虑到:
论文中也提到gpma的分割结果只是用来辅助判断“空白单元格”是否应合并或分割开,但是在实际的表格识别场景中,空单元格的分开与否对于表格理解的意义不是很大,整体对于评测最终的性能提升也比较有限,在推理阶段加入这部分反而会额外增加一些计算开销。但是这个模块在训练阶段的加入可以使模型收敛的效果更好,因此在训练阶段是加入了全局分割的分支。
如果想看gpma的预测mask效果,可以在推理的代码中进行相应的修改即可。

from davar-lab-ocr.

rcx986635 avatar rcx986635 commented on September 2, 2024

@qiaoliang6 感谢回复。明白,我理解是英文的空白单元格其实没啥用,但是中文文档中空白单元格其实还挺重要的,特别是对于某一列只有空白单元格的情况。如果我想利用gpma的mask来 生成空白单元格的bbox大概改哪里可以呢

from davar-lab-ocr.

qiaoliang6 avatar qiaoliang6 commented on September 2, 2024

@qiaoliang6 感谢回复。明白,我理解是英文的空白单元格其实没啥用,但是中文文档中空白单元格其实还挺重要的,特别是对于某一列只有空白单元格的情况。如果我想利用gpma的mask来 生成空白单元格的bbox大概改哪里可以呢

在lgpma.py的主流程文件lgpma.py中, L162-L167中是有将分割结果打包一起存放在results中的,在后处理的时候把这部分取出来使用就可以。

不过我个人感觉单凭分割学习到的结果其实准确率提升有限,因为空格子里面没有明显的像素差异,而且标注起来都会很有歧义。我觉得反而可以试试一些基于虚拟表格线的方法。

from davar-lab-ocr.

rcx986635 avatar rcx986635 commented on September 2, 2024

@qiaoliang6 我把gpma的mask写出去看了确实在空白单元格的地方有一点但是较弱。另外能具体讲下虚拟表格线的做法和标注吗,我搜了下没找到相关参考。

from davar-lab-ocr.

qiaoliang6 avatar qiaoliang6 commented on September 2, 2024

@qiaoliang6 我把gpma的mask写出去看了确实在空白单元格的地方有一点但是较弱。另外能具体讲下虚拟表格线的做法和标注吗,我搜了下没找到相关参考。

类似于腾讯的方案,或者是参考这篇文章.

from davar-lab-ocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.