MilkMp/CIA-World-Factbooks-Archive-1990-2025
Complete structured archive of every CIA World Factbook edition from 1990-2025. 281 countries, 36 years, 1M+ parsed data fields in SQL Server. Python ETL scripts included. Website created to allow users to explore the data without downloading. www.worldfactbookarchive.org
This project offers a comprehensive archive of the CIA World Factbooks from 1990-2025, transforming raw text and HTML into a structured, queryable database. It takes various formats of historical Factbook data and outputs a clean, unified dataset that can be easily analyzed. Anyone needing to research country-specific information over time, such as intelligence analysts, geopolitical researchers, or economists, would find this invaluable.
Use this if you need to access historical country data and geopolitical facts compiled by the CIA, structured for easy analysis and trend identification.
Not ideal if you need real-time, constantly updated geopolitical data, as this archive is historical and the Factbook itself has been discontinued.
Stars
66
Forks
12
Language
TSQL
License
—
Category
Last pushed
Mar 17, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/MilkMp/CIA-World-Factbooks-Archive-1990-2025"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.