google/langextract

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

/ 100

Established

This tool helps non-technical professionals like researchers or analysts to quickly pull specific, structured facts from large amounts of unstructured text, such as clinical notes, reports, or literary works. You provide raw text and define what information you're looking for (e.g., characters, medications, relationships), and it outputs an organized list of those extracted details, complete with their exact location in the original document and an interactive visualization. This is ideal for anyone needing to systematically find and verify specific data points across many documents without manual review.

34,668 stars. Actively maintained with 11 commits in the last 30 days. Available on PyPI.

Use this if you need to extract specific types of information from large volumes of text documents and want to ensure the extracted data is directly traceable back to its source.

Not ideal if your task requires summarizing or generating new text rather than strictly extracting existing facts, or if you don't need to verify extractions against their original context.

information-extraction clinical-data-analysis document-processing research-analysis qualitative-data

Maintenance 17 / 25

Adoption 10 / 25

Maturity 24 / 25

Community 18 / 25

How are scores calculated?

Stars

34,668

Forks

2,330

Language

Python

License

Apache-2.0

Recent Releases

v1.2.1 08 Apr 2026 v1.2.0 22 Mar 2026 v1.1.1 27 Nov 2025 v1.1.0 14 Nov 2025 v1.0.9 31 Aug 2025

Related tools

Extralit/extralit

Fast and accurate systemic data extraction with LLM assistance

Keyvanhardani/german-ocr

German-OCR is specifically trained to extract text from German documents including invoices,...

oidlabs-com/Lexoid

Multimodal document parser for high quality data understanding and extraction

xingbow/SciDaEx

Structured data extraction from research literature

parsee-ai/parsee-core

Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and...

Explore NLP Tools

All categories Trending NLP directory Insights