The-Gupta/TED-Scraper

Complete Web Scraping of TED.com for Metadata, Transcript, Audio, Video, Images using Parallel Programming

37
/ 100
Emerging

This tool helps researchers and data scientists gather comprehensive information from TED.com quickly. It takes the entire TED website as input and outputs a complete dataset including metadata, transcripts, audio, video, and speaker images. It's ideal for anyone who needs an up-to-date and extensive collection of TED Talk data for analysis or other projects.

No commits in the last 6 months.

Use this if you need a fast and complete way to collect all available data from TED.com, including media files, for your research or data analysis.

Not ideal if you only need a small subset of TED Talk information or prefer to manually download data.

data-collection academic-research content-analysis media-studies public-speaking-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

11

Forks

6

Language

Jupyter Notebook

License

MIT

Category

scraper

Last pushed

Jun 25, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/The-Gupta/TED-Scraper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.