scribe-org/Scribe-Data

Wikidata and Wiktionary language data extraction

67
/ 100
Established

This tool helps language content creators and developers easily extract and manage language-related data from Wikidata and Wiktionary. You specify the language and data type you need, and it provides structured linguistic information such as verbs or emoji keywords. It's ideal for anyone building language learning apps, linguistic analysis tools, or content localized for different languages.

Available on PyPI.

Use this if you need to quickly get specific linguistic data like words, definitions, or emojis for a particular language from Wikidata and Wiktionary.

Not ideal if you need highly customized subsets of lexeme forms or are working with massive datasets that require processing full Wikidata lexeme dumps.

language-resource linguistic-data content-localization language-learning knowledge-graph
Maintenance 10 / 25
Adoption 8 / 25
Maturity 25 / 25
Community 24 / 25

How are scores calculated?

Stars

55

Forks

94

Language

Python

License

GPL-3.0

Last pushed

Mar 14, 2026

Commits (30d)

0

Dependencies

55

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/scribe-org/Scribe-Data"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.