etalab-ia/mediatech
Collection of public datasets from the French administration, vectorized and ready to use in AI projects.
This project helps public sector data scientists and AI developers access French public administration datasets. It takes raw public data, processes it, creates 'vectorized' versions (numerical representations for AI), and stores it in a database. The output is a ready-to-use dataset that can power various AI applications.
Use this if you need to quickly access and integrate French public administration data, pre-processed and vectorized, into your AI projects or applications within the public sector.
Not ideal if you need raw, unprocessed public data, or if you are working with non-French public sector datasets.
Stars
9
Forks
3
Language
Python
License
MIT
Category
Last pushed
Jan 26, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/etalab-ia/mediatech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
harmonydata/harmony
The Harmony Python library: a research tool for psychologists to harmonise data and...
yannvgn/laserembeddings
LASER multilingual sentence embeddings as a pip package
embeddings-benchmark/results
Data for the MTEB leaderboard
Hironsan/awesome-embedding-models
A curated list of awesome embedding models tutorials, projects and communities.