MuhammadYaseenKhan/Urdu-Sentiment-Corpus

Labelled Dataset for Urdu Sentiment Analysis

/ 100

Emerging

This project provides a collection of Urdu text, primarily tweets, that have been manually labeled to indicate whether the sentiment expressed is positive, negative, or neutral. It helps researchers and linguists who are building or evaluating systems that can automatically detect the emotional tone within Urdu language content. The input is raw Urdu text, and the output is the same text with an associated sentiment label.

No commits in the last 6 months.

Use this if you are developing or testing algorithms to automatically understand sentiment in Urdu written content, especially social media posts.

Not ideal if you need a dataset for tasks other than sentiment analysis, such as topic modeling or language translation.

Urdu language sentiment analysis social media analysis computational linguistics natural language processing

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 19 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

acl-org/acl-anthology

Data and software for building the ACL Anthology.

anoopkunchukuttan/indic_nlp_library

Resources and tools for Indian language Natural Language Processing

CLUEbenchmark/CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

KennethEnevoldsen/scandinavian-embedding-benchmark

A Scandinavian Benchmark for sentence embeddings

Separius/awesome-sentence-embedding

A curated list of pretrained sentence and word embedding models

Explore NLP Tools

All categories Trending NLP directory Insights