ajdavidl/Portuguese-NLP

List of resources and tools developed with focus on Portuguese.

35
/ 100
Emerging

This project is a curated catalog of datasets specifically designed for working with the Portuguese language. It brings together various types of text and speech data, from news articles and social media posts to medical texts and court decisions, as well as tools and resources. If you're a linguist, researcher, or data scientist focusing on Portuguese natural language processing, this is your go-to resource to find suitable data for your projects.

311 stars. No commits in the last 6 months.

Use this if you need to find specialized Portuguese language datasets for research, sentiment analysis, essay scoring, speech recognition, or other text-based applications.

Not ideal if you are looking for general-purpose language models or tools that are not specifically focused on the Portuguese language.

Portuguese language processing linguistic research text analysis speech recognition data science
No License Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

311

Forks

32

Language

License

Last pushed

Jun 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ajdavidl/Portuguese-NLP"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.