Giter Site home page Giter Site logo

gucasbrg / pdf2htmlex Goto Github PK

View Code? Open in Web Editor NEW

This project forked from coolwanglu/pdf2htmlex

0.0 2.0 0.0 134.07 MB

Convert PDF to HTML without losing text or format.

Home Page: http://coolwanglu.github.com/pdf2htmlEX/

License: Other

CMake 1.69% Python 4.80% Shell 0.40% CSS 1.86% JavaScript 5.66% C++ 50.66% C 2.23% TeX 0.14% HTML 32.57%

pdf2htmlex's Introduction

# pdf2htmlEX

一图胜千言
A beautiful demo is worth a thousand words

  • Bible de Genève, 1564 (fonts and typography): HTML / PDF
  • Cheat Sheet (math formulas): HTML / PDF
  • Scientific Paper (text and figures): HTML / PDF
  • Full Circle Magazine (read while downloading): HTML / PDF
  • Git Manual (CJK support): HTML / PDF

pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies. Academic papers with lots of formulas and figures? Magazines with complicated layouts? No problem!

pdf2htmlEX is also an online publishing tool which is flexible for many different use cases.

Learn more about who and why should use pdf2htmlEX.

Features

  • Native HTML text with precise font and location.
  • Flexible output: all-in-one HTML or on demand page loading (needs JavaScript).
  • Moderate file size, sometimes even smaller than PDF.
  • Supporting links, outlines (bookmarks), printing, SVG background, Type 3 fonts and more...

Compare to others

Wiki Portals

Get in Touch

Get quick answers for common questions:

Don't miss the latest development news:

Discuss with the developers and the users of pdf2htmlEX

Chat with the main author: 王璐 (Lu Wang)

  • [email protected]
  • @coolwanglu
  • ‼️Questions about pdf2htmlEX? Use the mailling list instead.‼️
  • Accepting messages in 🇨🇳🇺🇸🇬🇧🇯🇵 (languages).

Want to help without coding? Thank you!

LICENSE

pdf2htmlEX, as a whole package, is licensed under GPLv3+. Some resource files are released with relaxed licenses, read LICENSE for more details.

Acknowledgements

pdf2htmlEX is made possible thanks to the following projects:

pdf2htmlEX is inspired by the following projects:

  • pdftohtml from poppler
  • MuPDF
  • PDF.js
  • Crocodoc
  • Google Doc

Special Thanks

  • Hongliang Tian
  • Wanmin Liu

pdf2htmlex's People

Contributors

coolwanglu avatar duanyao avatar jahewson avatar marcsanfacon avatar afrosimon avatar tatetian avatar rmorlok avatar wanminliu avatar iapain avatar chriscinelli avatar dsablic avatar rafiot avatar jwilk avatar mortonfox avatar srijs avatar witwall avatar teddytrombone avatar joshweinstein avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.