ScientaNL/pdf-extractor
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
This tool helps developers transform PDF documents into various web-friendly assets like images (PNG, JPG), SVGs, HTML, and plain text files. You provide a PDF, and it generates separate files for each page, making it easy to display PDF content on websites or extract text for indexing. Web developers or anyone building web applications that need to display or process PDF content without a browser would find this useful.
107 stars. No commits in the last 6 months.
Use this if you need an open-source solution to convert PDF documents into web-ready images, SVGs, or extract text and HTML for display or further processing within a Node.js environment.
Not ideal if you are an end-user looking for a graphical PDF editor or viewer, as this is a developer tool requiring programming knowledge to implement.
Stars
107
Forks
22
Language
JavaScript
License
MIT
Category
Last pushed
May 16, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ai-coding/ScientaNL/pdf-extractor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
vercel/satori
Enlightened library to convert HTML and CSS to SVG
elegantapp/pwa-asset-generator
Automates PWA asset generation and image declaration. Automatically generates icon and splash...
KnpLabs/snappy
PHP library allowing thumbnail, snapshot or PDF generation from a url or a html page. Wrapper...
kyranet/canvas-constructor
An ES6 utility for canvas with built-in functions and chained methods.
KnpLabs/KnpSnappyBundle
Easily create PDF and images in Symfony by converting html using webkit