kennethleungty/LangExtract-Gemma-Structured-Extraction
Using LangExtract and Gemma 3 for structured information extraction from unstructured text in insurance polices
This project helps you quickly extract critical information from lengthy and complex documents such as insurance policies or medical records. It takes your unstructured text documents and automatically pulls out key details, organizing them into a clear, structured format. Anyone who regularly needs to find specific facts buried in dense legal or technical documents, like an insurance agent, paralegal, or compliance officer, would find this tool useful.
No commits in the last 6 months.
Use this if you spend a lot of time manually sifting through long documents to find specific details or clauses.
Not ideal if your documents are already highly structured (like forms or tables) or if you only need to process a few short documents occasionally.
Stars
32
Forks
5
Language
Python
License
MIT
Category
Last pushed
Aug 29, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/kennethleungty/LangExtract-Gemma-Structured-Extraction"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NanoNets/docstrange
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple...
th1nhhdk/local_ai_ocr
An local, offline (after initial setup), portable OCR software that can process images and PDF...
Dicklesworthstone/llm_aided_ocr
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking,...
emcf/thepipe
Get clean data from tricky documents, powered by vision-language models ⚡
langstruct-ai/langstruct
Extract structured data from any content using LLMs.