VladimirZelenokor1/Big-Data-Project---Predicting-Trip-Fares-with-Spark-Hive
A CRISP-DM–based big data pipeline for predicting NYC ride-sharing trip fares: ingesting 2024 TLC data via Sqoop into HDFS/Hive, performing ETL and feature engineering with Spark & PySpark, training and tuning Linear Regression & Gradient Boosted Tree models, and outlining end-to-end deployment.
No commits in the last 6 months.
Stars
1
Forks
—
Language
Java
License
—
Category
Last pushed
May 29, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/VladimirZelenokor1/Big-Data-Project---Predicting-Trip-Fares-with-Spark-Hive"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nasa/ML-airport-taxi-out
The ML-airport-taxi-out software is developed to provide a reference implementation to serve as...
MaxHalford/taxi-demo-rp-mz-rv-rd-st
🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi...
codelibra/Time-series-analysis-nyc-taxi
⏰ 📓 Time series analysis of new york taxi data
bartk97/NYC-Taxi-Anomaly-Detection
Final Project for the 'Machine Learning and Deep Learning' Course at AGH Doctoral School
RozhanMK/Taxi-demand-prediction
this repository contains main project of Rahnema college machine learning bootcamp