aryn-ai/sycamore

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

58
/ 100
Established

This helps data professionals and researchers extract and understand information from large collections of unstructured documents like reports, presentations, and manuals. It takes diverse document types, including complex PDFs with tables and images, and processes them to provide structured, enriched data ready for search or analysis. The primary users are data engineers, data scientists, or anyone building search and analytics applications on extensive document sets.

592 stars. Actively maintained with 5 commits in the last 30 days.

Use this if you need to reliably process a high volume of diverse unstructured documents into a structured format for AI-powered search, analytics, or other applications.

Not ideal if you only have a few simple text files to process or primarily work with already structured data.

document-processing information-extraction unstructured-data-analytics enterprise-search knowledge-management
No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

592

Forks

68

Language

Python

License

Apache-2.0

Last pushed

Mar 12, 2026

Commits (30d)

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/aryn-ai/sycamore"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.