All Data Engineering Tools
1,297 tools ranked by quality score · Page 7 of 13
| # | Tool | Score | Tier |
|---|---|---|---|
| 601 |
vishnuvardhanaan/equity-fundamental-engine
Production-style financial data engineering pipeline that standardizes NSE... |
|
Experimental |
| 602 |
jack-tol/usda-food-data-pipeline
Code for the USDA Branded Food Dataset pipeline and the USDA Food Assistant.... |
|
Experimental |
| 603 |
JulieGibbs/greenback-java
Java library to build modern applications with high-def itemized financial... |
|
Experimental |
| 604 |
lezwon/CatalystOps
Semantic cost-linting and performance warnings extension for Databricks in VS Code |
|
Experimental |
| 605 |
Hyperwindmill/morphql
Transform data with queries |
|
Experimental |
| 606 |
Wazzabeee/pyspark-etl-twitter
Implementation of an ETL process for real-time sentiment analysis of tweets... |
|
Experimental |
| 607 |
BirdiD/BirdiDQ
BirdiDQ leverages the power of the Python Great Expectations open-source... |
|
Experimental |
| 608 |
dhchenx/Catla-HS
Catla for Hadoop and Spark (Catla-HS): An open-source system to support... |
|
Experimental |
| 609 |
galafis/distributed-data-processing-pipeline
Enterprise-grade distributed data processing pipeline with Apache Spark... |
|
Experimental |
| 610 |
Xenios91/Byte-Chomp
A Golang tool for obtaining data on Golang binaries in csv format |
|
Experimental |
| 611 |
aduranil/personal-finance-frontend
personal finance mint.com-like site |
|
Experimental |
| 612 |
victorlopes2000/retail-intelligence-platform
๐๏ธ Analyze retail data with our platform, scraping insights from major... |
|
Experimental |
| 613 |
dannyb2018/CPTADataProviderAPI
This is core library for financial data access components in Apache Nifi |
|
Experimental |
| 614 |
andrejanesic/Spark-News-Stock-Market-Prediction
Data science and Spark applied to 7 hypotheses regarding the DJIA stock... |
|
Experimental |
| 615 |
18F/bpa-disaster-data-portal-pilot
The scope of this task is to build a working pilot of a portal that collects... |
|
Experimental |
| 616 |
betoalien/PardoX
PardoX: The Hyper-Fast Data Engine |
|
Experimental |
| 617 |
FurkAlb/Global-Power-Plant-Analysis
Global Power Plant Database Analysis is a Streamlit-based interactive web... |
|
Experimental |
| 618 |
calbergs/spotify-api
Pipeline that extracts data from the Spotify API to build a more detailed... |
|
Experimental |
| 619 |
mzafram2001/football-database-fver
โฝ Football database. Ideal for machine learning, betting and analytics. ๐... |
|
Experimental |
| 620 |
adhamhaithameid/Classroom-Quick-Downloader
A sophisticated cross-browser extension for bulk Google Classroom downloads,... |
|
Experimental |
| 621 |
nodef/extra-pg-english
Converts English query to Informal/Format SQL SELECT. |
|
Experimental |
| 622 |
theBlackfish01/FiberWatchCLI
CLI based interface for Optical Fiber Fault Detection, Diagnosis, and... |
|
Experimental |
| 623 |
ProjectXero/dbds
DBDataSource (dbds) is primarily a lightweight PostgreSQL-backed dataSource... |
|
Experimental |
| 624 |
maengsanha/bigdata
KMU CS Hot Topics in Big Data |
|
Experimental |
| 625 |
contriboss/no_fly_list
A flexible, high-performance tagging system for Rails applications with... |
|
Experimental |
| 626 |
Rakshan-kulkarni/Rakshan-Finance-Tracker
Rakshan/Finance Tracker |
|
Experimental |
| 627 |
NileDB/com.niledb.core
Open-source Data Backend written in Java and based on PostgreSQL & GraphQL. |
|
Experimental |
| 628 |
pmutua/drf_csv_xlsx_file_upload
Demo Django (Django Rest Framework) API uploads .csv/.xlsx for bulk data,... |
|
Experimental |
| 629 |
Surya-Hariharan/ESG-Sustainability-Analysis
Full-stack ESG analytics dashboard for S&P 500 companies with FastAPI,... |
|
Experimental |
| 630 |
abhiram-ar/humane-backend
Event-driven microservices backend for Humane, a behavior-rewarding social... |
|
Experimental |
| 631 |
AlvaroCavalcante/airflow-calendar-plugin
A Google Calendar-style plugin to improve your DAG management with a visual schedule |
|
Experimental |
| 632 |
elevata-labs/elevata
elevata is an Architecture Runtime for modern data platforms โ... |
|
Experimental |
| 633 |
faltz009/Closure-SDK
A hash you can do algebra on โ composable verification for ordered data over... |
|
Experimental |
| 634 |
nvisycom/runtime
Enterprise-grade multimodal redaction runtime that detects and removes... |
|
Experimental |
| 635 |
1712n/dedup-service
A high-performance service designed to eliminate duplicate and... |
|
Experimental |
| 636 |
nicopon/dtpipe
A simple, self-contained CLI for performance-focused data streaming & anonymization. |
|
Experimental |
| 637 |
Aniket-16-S/Product-Scraper
Scrapping products from well known e-com. sites like Amazon, Flipkart and... |
|
Experimental |
| 638 |
MaxHalford/tuna
:fish: A streaming ETL for fish |
|
Experimental |
| 639 |
Galaticos-API/API-3
Projeto da API do primeiro semestre de 2026 |
|
Experimental |
| 640 |
RealAlexandreAI/io-sankey
๐งถ Framework for IO mapping and validation across heterogeneous data. |
|
Experimental |
| 641 |
ThinkThinkAI/ThinkDB
ThinkDB is an easy-to-use SQL client that makes working with your databases... |
|
Experimental |
| 642 |
hummer-team/vault77
LLM, DuckDB, Excel, CSV , Data Analysis |
|
Experimental |
| 643 |
aymane-maghouti/Big-Data-Project
This project aims to predict smartphone prices using a combination of batch... |
|
Experimental |
| 644 |
hariketsheth/BlockChain_FinTech
Cash flow is one of the most critical aspects of the supply chain, and it... |
|
Experimental |
| 645 |
vishnuvardhanaan/equity-fundamental-analytics
Macro-aware, explainable equity analytics system using BronzeโSilverโGold... |
|
Experimental |
| 646 |
tbrus/smartjoin
Deterministic key and join discovery for structured datasets |
|
Experimental |
| 647 |
KasperOmsK/pipefn
pipefn is a Go library for building lazy, functional, and composable... |
|
Experimental |
| 648 |
joaopn/social-data-pipeline
Pipeline for processing, classifying, and ingesting large-scale social data |
|
Experimental |
| 649 |
liuweizhenhaoa/summer
Java |
|
Experimental |
| 650 |
TheCocoTeam/source-watcher-core
PHP ETL engine for building extractโtransformโload pipelines with pluggable... |
|
Experimental |
| 651 |
ReinerCPrecillas/Peek
๐ Monitor your macOS network in real-time with Peekโget instant insights on... |
|
Experimental |
| 652 |
nitish9413/open_auto_loader
OpenAutoLoader: A lightweight, open-source alternative to Databricks Auto... |
|
Experimental |
| 653 |
ankman007/cricket-statsguru
Streamlit-based Nepali cricket visualization dashboard that utilizes python... |
|
Experimental |
| 654 |
feitasIoT/CRose
CRose๏ผChina... |
|
Experimental |
| 655 |
edwinweber/dbt_duckdb_demo_public
Data engineering demo project for Danish Parliament (Folketing) open data โ... |
|
Experimental |
| 656 |
raphaelberly/journal
A movie journal coupled with open IMDb data, and a Flask web-app for easy... |
|
Experimental |
| 657 |
RaySatish/Market-Surveillance-System
Big-data pipeline detecting wash trading, pump & dump, and spoofing in trade... |
|
Experimental |
| 658 |
AhmedMaghawry/SPOFI
Spotfire is a crowd-sourcing tool that can support real-time detection and... |
|
Experimental |
| 659 |
cobluestars/dataherd-raika
"Dataherd-Raika is a library designed to simulate large-scale user behavior... |
|
Experimental |
| 660 |
COS301-SE-2021/Integrated-Data-Intelligence-Suite
The Integrated Data Intelligence Suite is a data-collection and data-mining... |
|
Experimental |
| 661 |
SermetPekin/evdschat
evdschat is an open-source Python package designed to enhance the evdspy... |
|
Experimental |
| 662 |
0xjgv/inconnu
Data privacy tool, for fast & thorough anonymization/pseudonymization, easy... |
|
Experimental |
| 663 |
zatarain/crm-dupkiller
CRM DupKiller - Hack Night @ Cloudflare 2025 ft. Fiberplane, Claude, Elevenlabs |
|
Experimental |
| 664 |
galafis/data-mesh-implementation-framework
Data Mesh concepts in Python - Data Products with schema validation, CRUD,... |
|
Experimental |
| 665 |
yamtimor/BirdLane
Kotlin DSL for expressive, code-first data pipelines, inspired by jazz. |
|
Experimental |
| 666 |
DonkeyKing01/EV-PM-DSS
Prototype decision-support dashboard built on the SCSI-SLM EV design insight... |
|
Experimental |
| 667 |
ps982182/AI-Business-Insights-Dashboard
AI-powered sales analytics dashboard built with Streamlit that generates... |
|
Experimental |
| 668 |
kholdrex/code_to_query
Ask for data in plain English; get validated, parameterized SQL with guardrails. |
|
Experimental |
| 669 |
AmirhosseinHonardoust/Market-IQ
MarketIQ is a full-stack Streamlit + SQL + Prophet dashboard for real-time... |
|
Experimental |
| 670 |
anjanicoder/Lok-Sabha-Election-Analysis
This project focuses on analyzing the Lok Sabha Election data of India. The... |
|
Experimental |
| 671 |
zsoltmester/anomaly-detector
Detect anomaly in call detail records. |
|
Experimental |
| 672 |
Codex56799/dataengineering
๐ Build a containerized data engineering workflow for NYC Yellow Taxi Trip... |
|
Experimental |
| 673 |
salimt/Transfermarkt-ETL-and-LIVE-Scores
asyncIO, Github Actions, GCP, dbt, Terraform, Docker |
|
Experimental |
| 674 |
AmirhosseinHonardoust/Beyond-Charts-Interactive-Storytelling
A comprehensive guide and codebase for building interactive storytelling... |
|
Experimental |
| 675 |
HatiOS-AI/HatiData-SDKs
Local-first data warehouse for AI agents. Write Snowflake-compatible SQL,... |
|
Experimental |
| 676 |
MostafaSensei106/FP-Growth
A high-performance Dart library for FP-Growth algorithm and association rule... |
|
Experimental |
| 677 |
TheoV823/cannabis-price-index
Open-source methodology, SQL, and sample data for a Cannabis Price Index.... |
|
Experimental |
| 678 |
pandabear-neil/microsoft_fabric_mods
Code Snippets, Designs, and other things about building a Data Analytics... |
|
Experimental |
| 679 |
abdullahqaisar/sehatchain
SehatChain, an AI and Blockchain powered tool for researchers and healthcare... |
|
Experimental |
| 680 |
supaglue-labs/typescript-syncer
Quickly sync your customers' CRM data to various destinations |
|
Experimental |
| 681 |
tanmaytanmay47/brazilian-ecommerce-data-warehouse
๐ Analyze Brazilian e-commerce data with this complete Business Intelligence... |
|
Experimental |
| 682 |
developmentseed/skynet-scrub-server
Backing store for developmentseed/skynet-scrub |
|
Experimental |
| 683 |
eduardocornelsen/full-funnel-ai-analytics
Full-Funnel AI Marketing Analytics. A modern data stack powered by dbt... |
|
Experimental |
| 684 |
BlackRoad-Forge/RoadHailoVision
BlackRoad Forge โ hailo vision โ BlackRoad Forge. Enhanced developer tools... |
|
Experimental |
| 685 |
erangi/podcasts
The list of podcasts I listen to |
|
Experimental |
| 686 |
Skeyelab/Zendesk-Data-Collector
Rails ETL for Zendesk โ collects and syncs Zendesk ticket data into PostgreSQL |
|
Experimental |
| 687 |
cypherpunk-symposium/blockchain-data-engineering-toolkit
๐พ blockchain infrastructure projects and resources (e.g., ethereum event... |
|
Experimental |
| 688 |
cyclonite69/shadowcheck-web
ShadowCheck SIGINT Forensics Platform - Real-time wireless network analysis |
|
Experimental |
| 689 |
formeo/igaming-platform
iGaming Platform Core โ Wallet Service, Bonus Engine & ML-powered Fraud... |
|
Experimental |
| 690 |
xxxsleepygamerxxx/directly
๐ Accelerate your browsing with Directly, a Chromium extension for quick... |
|
Experimental |
| 691 |
benzsevern/goldenflow
Data transformation toolkit โ 43+ transforms, 5 domain packs. 10 MCP tools... |
|
Experimental |
| 692 |
RafiQamar/IMDb-Movie-Analysis
This project involves web scraping, data preprocessing, database storage and... |
|
Experimental |
| 693 |
SoftwareTree/gilhari_ecommerce_example
A RESTful Gilhari microservice demonstrating ORM for JSON objects with an... |
|
Experimental |
| 694 |
GSA/coe-hud-acq-advanced-analytics
A repository for information related to the Data Analytics team's Advanced... |
|
Experimental |
| 695 |
GSA/coe-hud-acq-data-visualization
A repository for information related to the Data Analytics team's Data... |
|
Experimental |
| 696 |
shrutikar/DisasterRecord
DisasterRecord- Disaster Response and Relief Coordination pipeline. |
|
Experimental |
| 697 |
tosh2230/stairlight
A data lineage tool detects table dependencies from rendered SQL statements. |
|
Experimental |
| 698 |
wapplewhite4/fastdedup
Fast, memory-efficient dataset deduplication for ML workloads |
|
Experimental |
| 699 |
FranusCode/credit-risk-scoring-sas
Klasyfikacja ryzyka kredytowego klientรณw banku. Projekt obejmuje inลผynieriฤ... |
|
Experimental |
| 700 |
redzeptech/ASENA-ANALYSIS
ASENA-ANALYSIS: A hybrid Intrusion Detection System (IDS) that combines... |
|
Experimental |