pdf2htmlEX

Convert PDF to HTML without losing text or format.

  • Owner: coolwanglu/pdf2htmlEX
  • Platform:
  • License:: Other
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

pdf2htmlEX is no longer under active development. New maintainers are wanted.

# pdf2htmlEX

一图胜千言A beautiful demo is worth a thousand words

  • Bible de Genève, 1564 (fonts and typography): HTML / PDF
  • Cheat Sheet (math formulas): HTML / PDF
  • Scientific Paper (text and figures): HTML / PDF
  • Full Circle Magazine (read while downloading): HTML / PDF
  • Git Manual (CJK support): HTML / PDF

pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies.
Academic papers with lots of formulas and figures? Magazines with complicated layouts? No problem!

pdf2htmlEX is also an online publishing tool which is flexible for many different use cases.

Learn more about who and why should use pdf2htmlEX.

Features

  • Native HTML text with precise font and location.
  • Flexible output: all-in-one HTML or on demand page loading (needs JavaScript).
  • Moderate file size, sometimes even smaller than PDF.
  • Supporting links, outlines (bookmarks), printing, SVG background, Type 3 fonts and more...

Compare to others

Portals

LICENSE

pdf2htmlEX, as a whole package, is licensed under GPLv3+.
Some resource files are released with relaxed licenses, read LICENSE for more details.

Acknowledgements

pdf2htmlEX is made possible thanks to the following projects:

pdf2htmlEX is inspired by the following projects:

  • pdftohtml from poppler
  • MuPDF
  • PDF.js
  • Crocodoc
  • Google Doc

Special Thanks

  • Hongliang Tian
  • Wanmin Liu

Main metrics

Overview
Name With Ownercoolwanglu/pdf2htmlEX
Primary LanguageHTML
Program languageCMake (Language Count: 10)
Platform
License:Other
所有者活动
Created At2012-08-04 17:59:25
Pushed At2023-06-02 21:11:14
Last Commit At2022-08-05 12:00:41
Release Count16
Last Release Namev0.14.6 (Posted on )
First Release Namev0.1 (Posted on )
用户参与
Stargazers Count10.5k
Watchers Count508
Fork Count1.9k
Commits Count1.7k
Has Issues Enabled
Issues Count686
Issue Open Count231
Pull Requests Count50
Pull Requests Open Count14
Pull Requests Close Count32
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private