vistec-AI/thai2transformers

Pretraining transformer based Thai language models

54
/ 100
Established

This project helps natural language processing engineers and researchers working with the Thai language. It provides tools to pretrain transformer-based language models on Thai texts, taking raw Thai text datasets and producing a pretrained model. It also offers scripts to fine-tune these models for specific tasks like text classification, named entity recognition (NER), or part-of-speech (POS) tagging.

125 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to build or adapt state-of-the-art language models specifically for Thai language processing tasks.

Not ideal if you are looking for a pre-trained, ready-to-use model without any customization or further training.

Thai-language-processing NLP-research text-classification named-entity-recognition part-of-speech-tagging
Stale 6m No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

125

Forks

23

Language

Jupyter Notebook

License

Apache-2.0

Last pushed

Nov 06, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/vistec-AI/thai2transformers"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.