mit-nlp/MITIE
MITIE: library and tools for information extraction
This tool helps data analysts and researchers automatically extract specific pieces of information from text, such as names of people, organizations, or relationships between entities. You input raw text documents, and it outputs the same text with key information identified and categorized. This is ideal for anyone working with large volumes of text data who needs to quickly find and categorize specific facts or connections.
2,963 stars. No commits in the last 6 months.
Use this if you need to programmatically identify and extract named entities or relationships from English, Spanish, or German text for analysis or further processing.
Not ideal if you're looking for a user-friendly, out-of-the-box application with a graphical interface for general text analysis without programming.
Stars
2,963
Forks
534
Language
C++
License
—
Category
Last pushed
Sep 28, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/mit-nlp/MITIE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
facebookresearch/stopes
A library for preparing data for machine translation research (monolingual preprocessing,...
Droidtown/ArticutAPI
API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到...
rkcosmos/deepcut
A Thai word tokenization library using Deep Neural Network
fukuball/jieba-php
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation:...
pytorch/text
Models, data loaders and abstractions for language processing, powered by PyTorch