pbcquoc/vietnamese_word_seperate
Seperate vietnamese using lstm
This tool helps people who work with Vietnamese text by automatically adding spaces between words that have been incorrectly merged. You provide raw Vietnamese text where words are stuck together, and it returns the same text with proper word segmentation. This is ideal for linguists, data entry specialists, or anyone needing to process unsegmented Vietnamese for analysis or readability.
No commits in the last 6 months.
Use this if you have Vietnamese text that lacks proper word separation and needs to be cleaned up for better readability or further linguistic processing.
Not ideal if your Vietnamese text is already correctly segmented or if you are working with languages other than Vietnamese.
Stars
18
Forks
7
Language
Jupyter Notebook
License
—
Category
Last pushed
Aug 17, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/pbcquoc/vietnamese_word_seperate"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
vunb/vntk
Vietnamese NLP Toolkit for Node
vncorenlp/VnCoreNLP
A Vietnamese natural language processing toolkit (NAACL 2018)
VinAIResearch/PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity...
IBM/transition-amr-parser
SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch....
duyvuleo/VNTC
A Large-scale Vietnamese News Text Classification Corpus