ScientaNL/pdf-extractor

Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata

44
/ 100
Emerging

This tool helps developers transform PDF documents into various web-friendly assets like images (PNG, JPG), SVGs, HTML, and plain text files. You provide a PDF, and it generates separate files for each page, making it easy to display PDF content on websites or extract text for indexing. Web developers or anyone building web applications that need to display or process PDF content without a browser would find this useful.

107 stars. No commits in the last 6 months.

Use this if you need an open-source solution to convert PDF documents into web-ready images, SVGs, or extract text and HTML for display or further processing within a Node.js environment.

Not ideal if you are an end-user looking for a graphical PDF editor or viewer, as this is a developer tool requiring programming knowledge to implement.

web-development document-processing content-extraction digital-publishing api-migration
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

107

Forks

22

Language

JavaScript

License

MIT

Last pushed

May 16, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ai-coding/ScientaNL/pdf-extractor"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.