samitha9125/SinhalaTextClassification
Sinhala Text Classification based on n-grams
This tool helps you automatically sort Sinhala news articles into predefined categories, such as 'Local' or 'International'. You provide news text files, and it tells you which category each piece belongs to. This is ideal for anyone who needs to quickly organize or filter large volumes of Sinhala news content.
No commits in the last 6 months.
Use this if you need to classify Sinhala news articles into distinct categories.
Not ideal if you require very high accuracy for nuanced Sinhala text classification, or if your news articles contain complex vocabulary not present in the training data.
Stars
18
Forks
6
Language
Python
License
GPL-3.0
Category
Last pushed
Oct 14, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/samitha9125/SinhalaTextClassification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
derhuerst/nbayes
A Naive Bayes classifier written in JavaScript.
hankcs/text-classification-svm
The missing SVM-based text classification module implementing HanLP's interface
ningchaoar/UnsupervisedTextClassification
基于关键词的无监督文本分类;Implementation for paper "Text Classification by Bootstrapping with Keywords, EM...
qyfang/TextClassification
基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
fullstackyang/article-classifier
基于朴素贝叶斯实现的一款微信公众号文章分类器