IndoNLP/indonlu

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

51
/ 100
Established

This project offers a comprehensive set of tools and data for understanding and processing the Indonesian language. It helps researchers and AI practitioners build models that can analyze Indonesian text, taking in raw text data and producing outputs for tasks like sentiment analysis, named entity recognition, or question answering. This is for anyone creating or evaluating AI applications focused on the Indonesian language.

638 stars. No commits in the last 6 months.

Use this if you are developing or benchmarking natural language processing models specifically for the Indonesian language.

Not ideal if your focus is on languages other than Indonesian, or if you require pre-built, ready-to-deploy solutions without customization.

Indonesian-language-processing text-analysis AI-research language-model-development computational-linguistics
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

638

Forks

211

Language

Jupyter Notebook

License

Apache-2.0

Last pushed

Nov 16, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/IndoNLP/indonlu"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.