AlanBinu007/AI_Big-Data_Data-Engineering_and_Distributions
Here we created some projects using Kafka, AI , Data virtualization and Machine Learning
This collection of projects helps developers explore fundamental tasks in data engineering, big data processing, and machine learning. It provides examples of setting up and interacting with Kafka for real-time data streaming, performing basic data cleaning, and implementing simple AI/Machine Learning models. The projects demonstrate how to move data through different stages, from ingestion to processing and initial analysis.
No commits in the last 6 months.
Use this if you are a developer looking for hands-on examples to understand the basics of Kafka, data virtualization, data cleaning, and simple machine learning concepts within a big data context.
Not ideal if you are a non-technical end-user looking for a ready-to-use application to solve a specific business problem without writing code.
Stars
32
Forks
18
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 18, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/AlanBinu007/AI_Big-Data_Data-Engineering_and_Distributions"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
lensacom/sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
Angel-ML/angel
A Flexible and Powerful Parameter Server for large-scale machine learning
flink-extended/dl-on-flink
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow,...
tirthajyoti/Spark-with-Python
Fundamentals of Spark with Python (using PySpark), code examples
jadianes/spark-py-notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython...