lbellonda / confrontapdf Goto Github PK
View Code? Open in Web Editor NEWConfrontaPDF compares PDF files, GUI or command line
ConfrontaPDF compares PDF files, GUI or command line
The comparision does cover words or characters but not layout or linebreaking
Linebreaking even without hyphens should be considered a difference. The line in boldface are different, the hyphen is spotted correctly.
Tried to use the visual difference but that resulted in massive differences on almost every page in a large PDF with more than 1000 pages and very few relevant differences in text and layout.
I guess reason is the use of TextBox from poppler which does not cover layout. The tool pdftotext -layout does provide the differences in layout when I do a diff on the text that can be found.
I would propose a forth mode of comparison "line-by-line" applying the methods from pdftotext.
Pinning the program in the task bar of Windows 7 should show the program icon and not a generic one.
Version: 1.0.0
Top and bottom margin were set using ini file, few the pages are getting excluded from comparison. We had 2 pdfs with 24 pages in one and 25 in other. If compared with top and bottom marked as 0, there we get 24 pages in compare pdf where as if top margin is set to 300 and bottom to 200 with exclude true then only 18 pages are there in compared result pdf.
Hi.
I really admire your work and i want to add some features.
But when i follow your readme steps i cant.
Please can you provide me with more detailed steps (with links of qt , poppler) and espacially how to install poppler and get the header files for qt.
This is my private email : [email protected]
Check optionally fonts and font embedding conditions:
In the XML report, in the info section of the files, write info about used fonts, and insert a new option to exclude them for performance reasons.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.