Wazzabeee/pyspark-etl-twitter
Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta Lake
This project helps you monitor public opinion about topics or brands as it happens, by analyzing tweets in real time. It takes live Twitter data, processes it through a pre-built sentiment analysis model, and outputs whether the sentiment is positive, negative, or neutral. Social media strategists, brand managers, or marketing analysts can use this to quickly gauge public mood.
No commits in the last 6 months.
Use this if you need to track how people are feeling about a specific subject or your brand on Twitter right now, to inform rapid response or strategy adjustments.
Not ideal if you're looking for deep historical analysis of sentiment or want to analyze data from platforms other than Twitter.
Stars
19
Forks
5
Language
Python
License
—
Category
Last pushed
May 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/Wazzabeee/pyspark-etl-twitter"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
melvynator/ELK_twitter
This is a data pipeline for Twitter (ETL) using the elastic stack Elasticsearch, Logstash and...
sergio11/covid_tweets_etl_architecture
๐๐งช This is a learning-focused POC that explores a microservices ETL architecture for real-time...
alireza-heidarii/Real-Time-Data-Cleaning-Pipeline-for-Medical-and-Healthcare-Data
A real-time data cleaning pipeline for medical and healthcare data using Apache Spark, SparkNLP,...
msloan10/Bitcoin-Dashboard
A Bitcoin dashboard that incorpoartes sentiment analysis using Twitter data.
adilsaid64/sentiment-stream
An end-to-end real-time data streaming pipeline that leverages Kafka and Spark Streaming to...