Comments (3)
This turned out to be quite an easy fix. Please try the latest code or attached NuGet package and let me know if it resolves your problem.
from pdfpig.
I can confirm I now get text content. Thank you for the quick fix.
from pdfpig.
This is because the content of this document is using PostScript form XObjects. This won't actually make a difference to how the content should be consumed through the API it's just something I haven't implemented yet because this is only the 2nd ever time I've seen content encoded in this way: https://github.com/UglyToad/PdfPig/blob/master/src/UglyToad.PdfPig.Tests/Integration/SinglePageFormContentIText1Tests.cs#L37
I'll try to get round to adding support for this before the end of the week.
from pdfpig.
Related Issues (20)
- Unable to parse pdf due to font issue
- UnsupervisedReadingOrder orders 2 blocks on the same row out of order HOT 2
- PDF linearization
- When a get textblock from a PDF vary depending on the operating system HOT 6
- New Nuget package release for PDF Pig HOT 4
- XYLeaf.GetLines collect lines not robust enough HOT 18
- Extracting lines HOT 3
- Copy existing page to PdfDocumentBuilder without it's text HOT 1
- TryGetForm does not support field partial names with a "." HOT 5
- Support p7m signed PDFs
- Why GlyphRectangle bounding box not correct for letter g?
- Errors in examples on "readme.md" ? HOT 1
- Allow reading orders dectors to support any class that has a bounding box/PdfRectangle HOT 1
- File exception: UglyToad.PdfPig.Core.PdfDocumentFormatException' was thrown. HOT 4
- ArgumentOutOfRangeException when reading a document HOT 7
- Add image to PDF with different coordinate origin
- Using DuplicateOverlappingTextProcessor in HOcrTextExporter
- Populate data catalog info without reading the rest of the pdf
- Read document structure and apply PDF accessibility tag? HOT 1
- "Object reference not set to an instance of an object." HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pdfpig.