spartypkp/open-source-legislation
Open-source global legislation data in an SQL knowledge-graph format ideal for use with LLMs: Download legislation data in bulk and immediately start building with our Python/Typescript SDKs. Democratize Legal Knowledge For All
This project was an effort to collect and standardize global legislative data, transforming raw government website information into a unified SQL database ready for use in AI applications. The goal was to provide a structured dataset of laws and regulations, complete with machine-readable embeddings, to make it easier for legal professionals, researchers, or anyone building legal technology to access and analyze legislative text without needing to manually collect and clean the data themselves. It aimed to take disorganized legislative documents and turn them into an organized, queryable knowledge graph.
Use this if you are a developer or researcher looking for a reference example of how to build a web scraping and data modeling pipeline for complex, hierarchical text data like legal statutes, particularly if you're interested in structuring it for AI applications.
Not ideal if you need current, actively maintained legislative data or working tools, as the project is archived, its data downloads are broken, and its scrapers are outdated.
Stars
14
Forks
2
Language
Python
License
—
Category
Last pushed
Nov 15, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/spartypkp/open-source-legislation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.