MDGrey33/pyvisionai
The PyVisionAI Official Repo
This tool helps you quickly understand the content of various documents like PDFs, Word files, PowerPoints, and even websites. It takes these documents and identifies both the text and images within them, then generates detailed descriptions of those images. This is perfect for researchers, content analysts, or anyone who needs to extract and summarize information from large numbers of documents.
112 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to automatically extract text and get descriptive summaries of images from a collection of documents or web pages.
Not ideal if you only need simple text extraction without any image analysis or if you require advanced document editing capabilities.
Stars
112
Forks
14
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 19, 2025
Commits (30d)
0
Dependencies
11
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/MDGrey33/pyvisionai"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
SwanHubX/SwanLab
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports...
mdsrqbl/omnihuman
AI model that understands text & humanoids.
stas00/ml-engineering
Machine Learning Engineering Open Book
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including...
analyticalrohit/AI-ML-Cheatsheets
All Stanford Cheatsheets: Artificial Intelligence, Transformers, LLMs, Deep Learning, Machine...