Global-Witness/augmenta
AI agent for enhancing datasets with information from the internet
This tool helps researchers and analysts enrich their datasets by using AI to find specific information from the internet. You provide a spreadsheet of entities (like companies or individuals) and a prompt describing what details you need, and it outputs an updated spreadsheet with the new information and source citations. This is ideal for investigators, journalists, or anyone needing to add verifiable context to existing data.
Available on PyPI.
Use this if you need to systematically add publicly available information to a large dataset, like classifying industries for a list of donors or finding background details on entities.
Not ideal if you require absolutely no risk of AI hallucination, as results still need human verification, or if you only need to research a few individual entities manually.
Stars
21
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 06, 2025
Commits (30d)
0
Dependencies
10
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/Global-Witness/augmenta"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Kain-90/RAG-Play
An interactive visualization tool for understanding Retrieval-Augmented Generation (RAG) pipelines.
rryam/LumoKit
Swift package for on-device Retrieval-Augmented Generation (RAG)
harvard-lil/warc-gpt
WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.
CoIR-team/coir
(ACL 2025 Main) A Comprehensive Benchmark for Code Information Retrieval.
constacts/ragtacts
RAG(Retrieval-Augmented Generation) for Evolving Data