Giter Site home page Giter Site logo

Comments (5)

kiran90429 avatar kiran90429 commented on August 17, 2024

I'm getting the below result.

Highlights

  • Page 1: (XXX: missing text!)

  • Page 1: (XXX: missing text!)

  • Page 1: (XXX: missing text!)

from pdfannots.

0xabu avatar 0xabu commented on August 17, 2024

This indicates that the PDF text extraction failed to find any characters in the bounds of the highlight. I would suggest trying the pdf2txt.py sample program that's part of the pdfjam library, and see if that produces any output for the relevant text. If not, then it's a possible bug/limitation in pdfjam and should be reported there. If you do get text for this PDF from pdf2txt, then I can look into why we can't extract it if you updload the relevant PDF file.

from pdfannots.

kiran90429 avatar kiran90429 commented on August 17, 2024

Sure, Thanks for the Update. I will try that and update here.

from pdfannots.

0xabu avatar 0xabu commented on August 17, 2024

Not sure why I wrote pdfjam above; I meant pdfminer!

Did you repro this with pdf2txt? Can I close this?

from pdfannots.

kiran90429 avatar kiran90429 commented on August 17, 2024

from pdfannots.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.