All Data Engineering Tools
1,297 tools ranked by quality score · Page 6 of 13
| # | Tool | Score | Tier |
|---|---|---|---|
| 501 |
Hardork/DataLoom
DataLoom旨在提供复杂的数据转换以及分析服务,用户上传数据源(支持MySQL、API、Excel等),用户可以从多种数据源中创建数据集而不必在意数据... |
|
Emerging |
| 502 |
moj-analytical-services/iam_builder
Little helper to write IAM policies |
|
Emerging |
| 503 |
youssefsoli/IPFE
InterPlanetary File Explorer [UofTHacks X Protocol Labs - Best Use of Estuary] |
|
Emerging |
| 504 |
SourceWatcher/source-watcher-core
PHP ETL engine with pluggable steps: extractors, transformers, loaders |
|
Emerging |
| 505 |
rush-db/rushdb-python
RushDB Python SDK |
|
Emerging |
| 506 |
vnvo/deltaforge
A versatile, high-performance Change Data Capture (CDC) engine built in... |
|
Emerging |
| 507 |
lennox55555/Savvy-CSV
Savvy CSV is an web application designed to effortlessly create the ideal... |
|
Emerging |
| 508 |
tvs-sde/oxford-omop-data-mapper
A documentation-centric DuckDB based ETL tool, implementing transformations... |
|
Emerging |
| 509 |
GSA/coe-hud-acq-hudcentral
A repository for information related to the Contact Center team's HUDCentral need. |
|
Emerging |
| 510 |
turbot/steampipe-plugin-linode
Use SQL to instantly query instances, domains and more from Linode. Open... |
|
Emerging |
| 511 |
joryeugene/dadbod-grip.nvim
Edit database tables like Vim buffers. Staged mutations + live SQL preview,... |
|
Emerging |
| 512 |
IgorNatann/project_e_commerce_dw
DW de e-commerce (Kimball/Star Schema) em SQL Server, com scripts, dados... |
|
Emerging |
| 513 |
turbot/steampipe-plugin-docker
Use SQL to instantly query Dockerfile commands and more from Docker. Open... |
|
Emerging |
| 514 |
AmirhosseinHonardoust/Data-Storytelling-Dashboard
A fully interactive data storytelling dashboard for e-commerce analytics.... |
|
Emerging |
| 515 |
guotong1988/Automatic-Label-Error-Correction
Automatic Label Error Correction www.techrxiv.org/users/679328/articles/731085 |
|
Emerging |
| 516 |
MTSWebServices/etl-entities
Basic ETL Entity classes for onETL |
|
Emerging |
| 517 |
turbot/steampipe-plugin-namecheap
Use SQL to instantly query Namecheap for domains, DNS host records & more.... |
|
Emerging |
| 518 |
illuin-tech/data-pipeline
Library for describing data transformation pipelines by compositing simple... |
|
Emerging |
| 519 |
turbot/steampipe-plugin-onepassword
Use SQL to instantly query 1Password vaults, items, files & more. Open... |
|
Emerging |
| 520 |
tracebloc/data-ingestors
tracebloc data pipeline for training/test dataset setup |
|
Emerging |
| 521 |
sopho-tech/sopho
Open Source Business Intelligence |
|
Emerging |
| 522 |
TA3/bearicade
🐻 Bearicade is an open-souce security framework for High Performance Computing |
|
Emerging |
| 523 |
the-pew-inc/the-pew
ThePew is an advanced system of records that enables enterprises to detect... |
|
Emerging |
| 524 |
Gaurav0807/DBT_Athena_with_Kafka
Kafka Integration with Dbt |
|
Emerging |
| 525 |
eventvisor/eventvisor
Fine-grained control over analytics events and logs via remote configuration |
|
Emerging |
| 526 |
turbot/steampipe-plugin-supabase
Use SQL to instantly query Supabase resources. Open source CLI. No DB required. |
|
Emerging |
| 527 |
rhinempi/sparkhit
sparkhit - analyzing large scale genomic data on the cloud |
|
Emerging |
| 528 |
frectonz/pg-when
Just say when. |
|
Emerging |
| 529 |
neo-technology-field/python-etl-lib
simple lib of ETL building blocks |
|
Emerging |
| 530 |
pr1m8/haive-dataflow
Data processing pipelines and ETL workflows for Haive agents |
|
Emerging |
| 531 |
sul-dlss/libsys-airflow
Airflow DAGS for migrating and managing ILS data into FOLIO along with other... |
|
Emerging |
| 532 |
turbot/steampipe-plugin-ibm
Use SQL to instantly query instances, networks, users and more from IBM... |
|
Emerging |
| 533 |
turbot/steampipe-plugin-jumpcloud
Use SQL to instantly query resources from JumpCloud. Open source CLI. No DB required. |
|
Emerging |
| 534 |
caiopizzol/fipe-data-pipeline
Coleta e processa dados históricos de preços da Tabela FIPE para PostgreSQL. |
|
Emerging |
| 535 |
MTSWebServices/spark-dialect-extension
Extend JDBC types support for Apache Spark. |
|
Emerging |
| 536 |
turbot/steampipe-plugin-heroku
Use SQL to instantly query apps, dynos and more from Heroku. Open source... |
|
Emerging |
| 537 |
MTSWebServices/horizon-hwm-store
Horizon HWM Store for onETL |
|
Emerging |
| 538 |
turbot/steampipe-plugin-trello
Use SQL to instantly query Trello organizations, boards, members,... |
|
Emerging |
| 539 |
turbot/steampipe-plugin-aiven
Use SQL to instantly query Aiven accounts, projects, teams, users & more.... |
|
Emerging |
| 540 |
turbot/steampipe-plugin-fly
Use SQL to instantly query fly.io resources. Open source CLI. No DB required. |
|
Emerging |
| 541 |
turbot/steampipe-plugin-updown
Use SQL to instantly query status (e.g. checks, downtimes) from updown.io.... |
|
Emerging |
| 542 |
qweliant/ankaa
POC for real-time monitoring and alert system for home hemodialysis,... |
|
Emerging |
| 543 |
thiskevinwang/terraform-provider-pinecone
Terraform Provider for managing resources on Pinecone.io |
|
Emerging |
| 544 |
samvardhan03/statforge
An open-source Python library and command-line interface designed to... |
|
Emerging |
| 545 |
turbot/steampipe-plugin-fastly
Use SQL to instantly query services, ACLs and more from Fastly. Open source... |
|
Emerging |
| 546 |
turbot/steampipe-plugin-env0
Use SQL to instantly query env0 resources. Open source CLI. No DB required. |
|
Emerging |
| 547 |
turbot/steampipe-plugin-awscfn
Use SQL to instantly query resources, data sources and more from AWS... |
|
Emerging |
| 548 |
turbot/steampipe-plugin-urlscan
Use SQL to instantly query urlscan.io. Open source CLI. No DB required. |
|
Emerging |
| 549 |
turbot/steampipe-plugin-pipes
Use SQL to instantly query Turbot Pipes resources across workspaces. Open... |
|
Emerging |
| 550 |
Ryanditko/Roadmap-Projects
A comprehensive collection of 180 curated project ideas across 6 technology... |
|
Emerging |
| 551 |
chayansraj/Python-ETL-pipeline-using-Airflow-on-AWS
This project demonstrates how to build and automate an ETL pipeline written... |
|
Emerging |
| 552 |
worldbank/OvertureLink-Data-Pipeline
This ETL pipeline allows you to query and extract Overture Maps data (such... |
|
Emerging |
| 553 |
masthead-data/terraform-google-masthead-agent
Google Cloud resources for Masthead Data agent integration. |
|
Emerging |
| 554 |
Joerndm/stock_portefolio_builder
Using Machine Learning to predict future stock prices and creating a stock... |
|
Emerging |
| 555 |
bytehub-ai/bytehub
ByteHub: making feature stores simple |
|
Emerging |
| 556 |
Valentine-Nkatha/app-owns-data
📊 Embed Power BI reports with ease using the App Owns Data pattern on... |
|
Emerging |
| 557 |
turbot/steampipe-plugin-splunk
Use SQL to instantly query logs, indexes, apps and more Splunk. Open source... |
|
Emerging |
| 558 |
turbot/steampipe-plugin-mailchimp
Use SQL to instantly query Mailchimp marketing data. Open source CLI. No DB required. |
|
Emerging |
| 559 |
turbot/steampipe-plugin-vercel
Use SQL to instantly query projects, teams, domains and more from Vercel.... |
|
Emerging |
| 560 |
turbot/steampipe-plugin-panos
Use SQL to instantly query PAN-OS firewalls, security policies & more. Open... |
|
Emerging |
| 561 |
turbot/steampipe-plugin-planetscale
Use SQL to instantly query PlanetScale databases, branches and more. Open... |
|
Emerging |
| 562 |
Infostrux-Solutions/snowflake-zero-to-chat-with-your-data
Snowflake Hands-on Lab - Zero to Chat with Your Data in 120 Minutes |
|
Emerging |
| 563 |
alireza-heidarii/Real-Time-Data-Cleaning-Pipeline-for-Medical-and-Healthcare-Data
A real-time data cleaning pipeline for medical and healthcare data using... |
|
Emerging |
| 564 |
TelescopeAI/doorbeen
Connect LLMs to SQL Databases |
|
Emerging |
| 565 |
turbot/steampipe-plugin-newrelic
Use SQL to instantly query alerts, events, and more from New Relic. Open... |
|
Emerging |
| 566 |
ankitaaidev/VarunNetra
VarunNetra is an AI-powered software platform that lets researchers,... |
|
Emerging |
| 567 |
giagiannis/data-profiler
Data profiler is an attempt to model the behavior of a given operator for a... |
|
Emerging |
| 568 |
idlab-discover/RustiFlow
Flow feature extraction tool built in Rust using eBPF |
|
Emerging |
| 569 |
pybrave/brave
BRAVE:Bioinformatics Reactive Analysis and Visualization Engine |
|
Emerging |
| 570 |
luciferchase/chase_hospitals
This is a GUI based Python connectivity project on Hospital Management. The... |
|
Emerging |
| 571 |
Paulescu/bytewax-hopsworks-example
Compute and store real-time features for crypto trading using Bytwax (stream... |
|
Emerging |
| 572 |
BBVA/pacarana
A standalone ETL tool to generate advanced features for your Machine... |
|
Emerging |
| 573 |
tushar2704/SQL-Portfolio
Collection of personal SQL projects and queries I've worked on, showcasing... |
|
Emerging |
| 574 |
perguard/pg-streaming-performance-data
Data collection, feature engineering and machine learning of performance traces |
|
Emerging |
| 575 |
zovchik0v/task-management
🛠️ Streamline task management with this full-stack solution featuring... |
|
Emerging |
| 576 |
docglow/docglow
Modern documentation site generator for dbt Core — lineage explorer, health... |
|
Emerging |
| 577 |
cnstlungu/portable-data-stack-bruin
A portable Datamart and Business Intelligence suite built with Docker,... |
|
Emerging |
| 578 |
granthjoshi01/AQI-Analysis-Project
End-to-end AQI data pipeline with automated collection, historical storage,... |
|
Emerging |
| 579 |
hekmon/DiskTools
A set of sysadmin tools to better handle disk management |
|
Emerging |
| 580 |
Mindbaz/python-gpostmaster-domains-datas
Downloads and flattends datas from Google Postmaster Tools (GPT) |
|
Emerging |
| 581 |
Trojan3877/diabetes-prediction-ml-pipeline
The Diabetes Prediction ML Pipeline is a production-ready end-to-end... |
|
Emerging |
| 582 |
elqvixote/metalurgica-data
Open database for metallurgical & mining processes. Synthetic and official... |
|
Emerging |
| 583 |
forshaws/homebrew-oboi
A universal oboi binary installer. Install oboi email fraud scoring into... |
|
Emerging |
| 584 |
Toparvion/heap-talk
PoC for leveraging LLMs to JVM heap dump analysis |
|
Emerging |
| 585 |
eddmann/strava-cli
Strava from your terminal. Pipe it, script it, automate it. |
|
Emerging |
| 586 |
AvaAvarai/Java-Parallel-Coordinates-Vis
Java Parallel Coordinates Visualization Tool, to visualize... |
|
Emerging |
| 587 |
lyrasis/kiba-extend
Extensions to Kiba ETL |
|
Emerging |
| 588 |
seanpm2001/AI2001_Category-Source_Code-SC-SQLPL
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️💾️📜️ The sourceCode:SQLPL category for AI2001,... |
|
Emerging |
| 589 |
everycure-org/kedro-argo
argo-kedro is a kedro-plugin for executing Kedro pipelines on Argo Workflows. |
|
Emerging |
| 590 |
nots1dd/Hydraware
A cross-platform solution toward a more hydro-knolwedgable world. Made for... |
|
Emerging |
| 591 |
Matrix030/SteamLensAI
Game analytics platform that converts Steam review data into actionable... |
|
Emerging |
| 592 |
gumberss/PurchaseListinator
Create your own purchase lists with its items and when you need to buy, use... |
|
Emerging |
| 593 |
wirthal1990-tech/USDA-Phytochemical-Database-JSON
76,907 phytochemical records enriched with PubMed, ClinicalTrials.gov,... |
|
Emerging |
| 594 |
seanpm2001/AI2001_Category-Source_Code-SC-SQL
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️💾️📜️ The sourceCode:SQL category for AI2001, containing... |
|
Emerging |
| 595 |
tarek-clarke/resilient-rap-framework
A resilient, fault‑tolerant telemetry analytics pipeline designed to... |
|
Emerging |
| 596 |
fixedit-ai/fixedit-data-agent-examples
This repository contains resources for the FixedIT Data Agent ACAP including... |
|
Experimental |
| 597 |
ggongg/Power-BI-Fabric-Workshop-
This repository will have datasets which can be used for the workshop as... |
|
Experimental |
| 598 |
runprism/prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines... |
|
Experimental |
| 599 |
refactoring-ai/Data-Collection
Collect refactorings with metrics from java source code. |
|
Experimental |
| 600 |
fishstormX/fishmaple
个人网站 https://www.fishmaple.cn |
|
Experimental |