wongnai/wongnai-corpus

Collection of Wongnai's datasets

45
/ 100
Emerging

This collection offers Thai language datasets primarily for natural language processing research. It includes search query words, some algorithmically and some human-labeled, along with a food dictionary, as well as restaurant reviews with star ratings. The datasets help researchers and data scientists build and evaluate models for tasks like word segmentation and review rating prediction.

No commits in the last 6 months.

Use this if you are developing or researching natural language processing models specifically for the Thai language, especially for tasks related to search query understanding or sentiment analysis of reviews.

Not ideal if your project does not involve the Thai language or if you need general-purpose text data unrelated to food, restaurants, or search queries.

Thai-language-NLP word-segmentation restaurant-reviews sentiment-analysis search-query-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

79

Forks

23

Language

License

LGPL-3.0

Last pushed

Aug 26, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/wongnai/wongnai-corpus"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.