salsowelim/tawseem

NLP crowdsourcing platform for word-level annotations

28
/ 100
Experimental

This tool helps linguistics researchers and NLP practitioners create high-quality training datasets for language models. You input raw text documents, and it provides a web platform where a team of annotators can manually segment words or tag parts of speech. The output is structured data with detailed word-level annotations, ready for building or improving NLP systems.

No commits in the last 6 months.

Use this if you need to manually annotate large amounts of text at the word level, especially for languages like Arabic where pre-existing NLP tools might be limited, and you want to coordinate multiple human annotators.

Not ideal if you require a highly secure, production-ready system for public use or if your annotation tasks are at a document or sentence level rather than individual words.

linguistics NLP dataset creation text annotation crowdsourcing Arabic language processing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 7 / 25

How are scores calculated?

Stars

10

Forks

1

Language

Go

License

MIT

Last pushed

Jun 05, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/salsowelim/tawseem"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.