sudoskys/TeleDataParser

Batch parsing of Telegram exported Json data files and extraction of the entire/specified corpus of a user/group for AI learning

29
/ 100
Experimental

This tool helps you process your exported Telegram chat history (JSON files) into plain text for training AI models or analyzing conversations. It takes your Telegram data and extracts all messages, specific replies, or statements from chosen users or groups, outputting them as a clean text corpus. It's designed for anyone who needs to convert raw chat logs into structured text for further analysis or machine learning tasks.

No commits in the last 6 months.

Use this if you need to extract and filter specific conversations or all messages from your Telegram chat exports to create a text dataset.

Not ideal if you need to process chat data from platforms other than Telegram or require more than just plain text extraction.

AI-training-data chat-analysis conversation-extraction text-corpus-generation social-media-data
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

7

Forks

1

Language

Python

License

Apache-2.0

Last pushed

Nov 13, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sudoskys/TeleDataParser"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.