nikitakit/self-attentive-parser
High-accuracy NLP parser with models for 11 languages.
This tool helps researchers, linguists, or data analysts automatically break down sentences into their grammatical structure. You feed it a sentence, and it outputs a parse tree showing how words group into phrases and clauses, making it easier to analyze sentence complexity and relationships between words. It's designed for anyone working with textual data who needs to understand the underlying grammatical composition of text in 11 different languages.
907 stars. No commits in the last 6 months.
Use this if you need to perform high-accuracy grammatical analysis on text to understand its constituent structure for research or advanced text processing.
Not ideal if you only need basic text processing like tokenization or sentiment analysis without requiring deep syntactic breakdown.
Stars
907
Forks
158
Language
Python
License
MIT
Category
Last pushed
Jan 10, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/nikitakit/self-attentive-parser"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
facebookresearch/stopes
A library for preparing data for machine translation research (monolingual preprocessing,...
Droidtown/ArticutAPI
API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到...
rkcosmos/deepcut
A Thai word tokenization library using Deep Neural Network
fukuball/jieba-php
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation:...
pytorch/text
Models, data loaders and abstractions for language processing, powered by PyTorch