VinAIResearch/PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
This tool helps researchers and natural language processing practitioners analyze text by breaking down sentences and identifying key information. It takes raw or pre-segmented text as input and outputs detailed linguistic annotations, including parts of speech, named entities (like people or organizations), and grammatical relationships between words. Language model developers and NLP researchers, particularly those working with Vietnamese or other languages with existing annotated corpora, would find this useful for training and evaluating custom models.
149 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to perform advanced linguistic analysis on text, such as identifying grammatical structures and named entities, especially for languages like Vietnamese.
Not ideal if you're looking for a simple keyword extractor or a tool that doesn't require prior knowledge of linguistic annotation tasks.
Stars
149
Forks
20
Language
Python
License
BSD-3-Clause
Category
Last pushed
Dec 31, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/VinAIResearch/PhoNLP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
vunb/vntk
Vietnamese NLP Toolkit for Node
vncorenlp/VnCoreNLP
A Vietnamese natural language processing toolkit (NAACL 2018)
IBM/transition-amr-parser
SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch....
duyvuleo/VNTC
A Large-scale Vietnamese News Text Classification Corpus
nert-nlp/AMR-Bibliography
Organized inventory of research using the Abstract Meaning Representation