sevapru/terrorblade
A unified data extraction and parsing platform for messaging platforms. It features Telegram message processing, data standardisation and analytics preparation capabilities.
Enables both API-driven and archive-based Telegram message extraction with asynchronous fetching via Telethon and JSON export processing, storing data in DuckDB with standardized schemas. Integrates GPU acceleration for preprocessing and generates embeddings for semantic vector search using HNSW indexing, supporting conversation clustering and incremental updates to avoid redundant processing. Designed as a modular platform with planned expansions to WhatsApp, VK, Instagram, and Facebook through future modules, alongside ethics-focused anonymization and visualization components.
Stars
6
Forks
1
Language
Python
License
—
Category
Last pushed
Jan 23, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/sevapru/terrorblade"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.