trannguyenhan/preprocessing-data

Tiền xử lý dữ liệu tiếng Việt với 4 bước

29
/ 100
Experimental

This tool helps Vietnamese content creators, marketers, or researchers prepare raw Vietnamese text for analysis. It takes messy, unstandardized Vietnamese text as input and outputs clean, consistently formatted text ready for further processing like text mining or classification. This is ideal for anyone working with large volumes of user-generated content or articles in Vietnamese.

No commits in the last 6 months.

Use this if you need to standardize and clean Vietnamese text data that might contain inconsistent formatting, Unicode errors, or incorrect capitalization.

Not ideal if your data is not in Vietnamese or if you require advanced natural language processing tasks beyond basic text cleaning.

Vietnamese-language-processing content-preparation text-analysis data-cleaning market-research
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 16 / 25

How are scores calculated?

Stars

14

Forks

6

Language

Jupyter Notebook

License

Last pushed

Aug 24, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/trannguyenhan/preprocessing-data"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.