louisbrulenaudet/docutron
Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.
This tool helps legal professionals efficiently extract specific information from legal documents like contracts, regulations, and case law. It takes unstructured legal text and outputs structured data, which can then be used to train specialized language models for tasks like contract analysis or legal summarization. This is ideal for legal researchers, paralegals, or attorneys who need to analyze large volumes of legal text.
No commits in the last 6 months.
Use this if you need to automate the process of extracting key information from many legal documents to create structured datasets.
Not ideal if you are looking for a ready-to-use application with a graphical interface rather than a toolkit that requires some technical setup.
Stars
26
Forks
1
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Oct 23, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/louisbrulenaudet/docutron"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bhimrazy/receipt-ocr
An efficient OCR engine for receipt image processing.
Juliofal4822/deepseek-ocr-multigpu-infer
🚀 Run efficient DeepSeek-OCR inference with Python scripts, supporting both single and multi-GPU...
Cross2pro/DeepSeek-OCR-Dashboard
An out-of-the-box local Web UI for DeepSeek-OCR. Built with FastAPI + Vue.js, it supports...
UnbrokenCocoon/OCR-evaluation
This project is a practical, beginner-friendly guide for users with datasets of scanned images,...