Krasjet/pdf.tocgen
A CLI toolset to generate table of contents for PDF files automatically.
This tool helps you quickly add an interactive table of contents to your software-generated PDF documents. You input a PDF file and a 'recipe' that defines heading styles, and it outputs a new PDF with a fully navigable table of contents. It's ideal for anyone who creates or manages PDF documents like technical manuals, reports, or e-books and needs to improve their navigation.
824 stars. No commits in the last 6 months. Available on PyPI.
Use this if you have a software-generated PDF (not a scanned document) and need to automatically create or update its internal table of contents for easier navigation.
Not ideal if your PDF documents are scanned images, as it relies on embedded font attributes and text positioning to identify headings.
Stars
824
Forks
27
Language
Python
License
GPL-3.0
Category
Last pushed
Nov 26, 2023
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/Krasjet/pdf.tocgen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.