Hyuto/indo-nlp
Library python sederhana tanpa dependency tambahan yang bertujuan untuk memudahkan proyek NLP anda.
This tool helps data analysts and researchers working with Indonesian text by providing easy access to pre-existing Indonesian text datasets. It takes raw Indonesian text, potentially containing emojis and slang, and processes it into a cleaner, more standardized format. Anyone analyzing social media, customer feedback, or other text in Indonesian would find this useful.
No commits in the last 6 months.
Use this if you need to quickly load and clean Indonesian text data for analysis or further processing.
Not ideal if your primary need is advanced natural language understanding or generation capabilities beyond basic preprocessing.
Stars
11
Forks
2
Language
Python
License
MIT
Category
Last pushed
Oct 17, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Hyuto/indo-nlp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
malaysia-ai/malaya
Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/
louisowen6/NLP_bahasa_resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
IndoNLP/indonlu
The first-ever vast natural language processing benchmark for Indonesian Language. We provide...
kirralabs/indonesian-NLP-resources
data resource untuk NLP bahasa indonesia
wongnai/wongnai-corpus
Collection of Wongnai's datasets