trannguyenhan/preprocessing-data

Tiền xử lý dữ liệu tiếng Việt với 4 bước

/ 100

Experimental

This tool helps Vietnamese content creators, marketers, or researchers prepare raw Vietnamese text for analysis. It takes messy, unstandardized Vietnamese text as input and outputs clean, consistently formatted text ready for further processing like text mining or classification. This is ideal for anyone working with large volumes of user-generated content or articles in Vietnamese.

No commits in the last 6 months.

Use this if you need to standardize and clean Vietnamese text data that might contain inconsistent formatting, Unicode errors, or incorrect capitalization.

Not ideal if your data is not in Vietnamese or if you require advanced natural language processing tasks beyond basic text cleaning.

Vietnamese-language-processing content-preparation text-analysis data-cleaning market-research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

castorini/hedwig

PyTorch deep learning models for document classification

kk7nc/Text_Classification

Text Classification Algorithms: A Survey

AnubhavGupta3377/Text-Classification-Models-Pytorch

Implementation of State-of-the-art Text Classification Models in Pytorch

inspirehep/magpie

Deep neural network framework for multi-label text classification

InseeFrLab/torchTextClassifiers

A unified framework for text classification in PyTorch.

Explore ML Frameworks

All categories Trending ML Framework directory Insights