indonesian-NLP-resources and id-nlp-resource
One resource is a collection of Indonesian NLP datasets and tools, while the other is a curated list of such resources, making them complements where the list provides an index to resources like the data collection.
About indonesian-NLP-resources
kirralabs/indonesian-NLP-resources
data resource untuk NLP bahasa indonesia
This is a collection of Indonesian language text and word data. It provides various datasets, including sentences from news articles and web content, as well as detailed word lists categorized by type (like root words, verbs, nouns, slang, and positive/negative sentiment words). Language researchers, computational linguists, and data scientists working with Indonesian text will find this useful for training language models or analyzing text data.
About id-nlp-resource
kmkurn/id-nlp-resource
A list of Indonesian NLP resources.
This is a curated list of publicly available language data for Indonesian, including vast collections of news articles, social media posts, and transcribed speech. It serves as a central hub for anyone needing Indonesian text or audio to train or evaluate language models, analyze sentiment, or build translation systems. Researchers, data scientists, and language technology developers focused on the Indonesian market would find this resource invaluable.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work