MDGrey33/pyvisionai

The PyVisionAI Official Repo

/ 100

Established

This tool helps you quickly understand the content of various documents like PDFs, Word files, PowerPoints, and even websites. It takes these documents and identifies both the text and images within them, then generates detailed descriptions of those images. This is perfect for researchers, content analysts, or anyone who needs to extract and summarize information from large numbers of documents.

112 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to automatically extract text and get descriptive summaries of images from a collection of documents or web pages.

Not ideal if you only need simple text extraction without any image analysis or if you require advanced document editing capabilities.

document-analysis content-extraction information-retrieval research-automation multimodal-analysis

Stale 6m

Maintenance 2 / 25

Adoption 9 / 25

Maturity 25 / 25

Community 14 / 25

How are scores calculated?

Stars

112

Forks

Language

Python

License

Apache-2.0

Related models

SwanHubX/SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports...

mdsrqbl/omnihuman

AI model that understands text & humanoids.

stas00/ml-engineering

Machine Learning Engineering Open Book

labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including...

analyticalrohit/AI-ML-Cheatsheets

All Stanford Cheatsheets: Artificial Intelligence, Transformers, LLMs, Deep Learning, Machine...

Explore Transformer Models

All categories Trending Transformer directory Insights