scribe-org/Scribe-Data
Wikidata and Wiktionary language data extraction
This tool helps language content creators and developers easily extract and manage language-related data from Wikidata and Wiktionary. You specify the language and data type you need, and it provides structured linguistic information such as verbs or emoji keywords. It's ideal for anyone building language learning apps, linguistic analysis tools, or content localized for different languages.
Available on PyPI.
Use this if you need to quickly get specific linguistic data like words, definitions, or emojis for a particular language from Wikidata and Wiktionary.
Not ideal if you need highly customized subsets of lexeme forms or are working with massive datasets that require processing full Wikidata lexeme dumps.
Stars
55
Forks
94
Language
Python
License
GPL-3.0
Category
Last pushed
Mar 14, 2026
Commits (30d)
0
Dependencies
55
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/scribe-org/Scribe-Data"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.