inuwamobarak/nougat
Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.
Nougat helps researchers, students, and scientists convert scientific PDFs into a structured Markdown format. It takes a scientific PDF as input and accurately transcribes its content, including equations and tables, into an editable Markdown file. This makes it easier to extract information, reformat content, and integrate scientific papers into other digital workflows.
No commits in the last 6 months.
Use this if you need to transform complex scientific PDFs into a machine-readable and easily editable text format for analysis, archiving, or reuse.
Not ideal if you primarily work with non-scientific documents or require OCR for handwritten notes, as it's specifically designed for the unique structures of scientific papers.
Stars
27
Forks
3
Language
Jupyter Notebook
License
—
Category
Last pushed
Oct 14, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/inuwamobarak/nougat"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
clusterzx/paperless-ai
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure...
kha-white/manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
alephpi/Texo-web
The web application for Texo, a minimalist SOTA LaTeX OCR model which contains only 20M...
bytefer/ollama-ocr
Implementing OCR with a local visual model run by ollama.
alephpi/Texo
A minimalist SOTA LaTeX OCR model with only 20M parameters, running in browser. Full training...