All Data Engineering Tools

1,297 tools ranked by quality score · Page 6 of 13

Showing 501–600 of 1,297
# Tool Score Tier
501 Hardork/DataLoom

DataLoom旨在提供复杂的数据转换以及分析服务,用户上传数据源(支持MySQL、API、Excel等),用户可以从多种数据源中创建数据集而不必在意数据...

34
Emerging
502 moj-analytical-services/iam_builder

Little helper to write IAM policies

34
Emerging
503 youssefsoli/IPFE

InterPlanetary File Explorer [UofTHacks X Protocol Labs - Best Use of Estuary]

34
Emerging
504 SourceWatcher/source-watcher-core

PHP ETL engine with pluggable steps: extractors, transformers, loaders

34
Emerging
505 rush-db/rushdb-python

RushDB Python SDK

34
Emerging
506 vnvo/deltaforge

A versatile, high-performance Change Data Capture (CDC) engine built in...

34
Emerging
507 lennox55555/Savvy-CSV

Savvy CSV is an web application designed to effortlessly create the ideal...

34
Emerging
508 tvs-sde/oxford-omop-data-mapper

A documentation-centric DuckDB based ETL tool, implementing transformations...

34
Emerging
509 GSA/coe-hud-acq-hudcentral

A repository for information related to the Contact Center team's HUDCentral need.

34
Emerging
510 turbot/steampipe-plugin-linode

Use SQL to instantly query instances, domains and more from Linode. Open...

33
Emerging
511 joryeugene/dadbod-grip.nvim

Edit database tables like Vim buffers. Staged mutations + live SQL preview,...

33
Emerging
512 IgorNatann/project_e_commerce_dw

DW de e-commerce (Kimball/Star Schema) em SQL Server, com scripts, dados...

33
Emerging
513 turbot/steampipe-plugin-docker

Use SQL to instantly query Dockerfile commands and more from Docker. Open...

33
Emerging
514 AmirhosseinHonardoust/Data-Storytelling-Dashboard

A fully interactive data storytelling dashboard for e-commerce analytics....

33
Emerging
515 guotong1988/Automatic-Label-Error-Correction

Automatic Label Error Correction www.techrxiv.org/users/679328/articles/731085

33
Emerging
516 MTSWebServices/etl-entities

Basic ETL Entity classes for onETL

33
Emerging
517 turbot/steampipe-plugin-namecheap

Use SQL to instantly query Namecheap for domains, DNS host records & more....

33
Emerging
518 illuin-tech/data-pipeline

Library for describing data transformation pipelines by compositing simple...

33
Emerging
519 turbot/steampipe-plugin-onepassword

Use SQL to instantly query 1Password vaults, items, files & more. Open...

33
Emerging
520 tracebloc/data-ingestors

tracebloc data pipeline for training/test dataset setup

33
Emerging
521 sopho-tech/sopho

Open Source Business Intelligence

33
Emerging
522 TA3/bearicade

🐻 Bearicade is an open-souce security framework for High Performance Computing

33
Emerging
523 the-pew-inc/the-pew

ThePew is an advanced system of records that enables enterprises to detect...

33
Emerging
524 Gaurav0807/DBT_Athena_with_Kafka

Kafka Integration with Dbt

33
Emerging
525 eventvisor/eventvisor

Fine-grained control over analytics events and logs via remote configuration

33
Emerging
526 turbot/steampipe-plugin-supabase

Use SQL to instantly query Supabase resources. Open source CLI. No DB required.

33
Emerging
527 rhinempi/sparkhit

sparkhit - analyzing large scale genomic data on the cloud

33
Emerging
528 frectonz/pg-when

Just say when.

33
Emerging
529 neo-technology-field/python-etl-lib

simple lib of ETL building blocks

33
Emerging
530 pr1m8/haive-dataflow

Data processing pipelines and ETL workflows for Haive agents

33
Emerging
531 sul-dlss/libsys-airflow

Airflow DAGS for migrating and managing ILS data into FOLIO along with other...

33
Emerging
532 turbot/steampipe-plugin-ibm

Use SQL to instantly query instances, networks, users and more from IBM...

33
Emerging
533 turbot/steampipe-plugin-jumpcloud

Use SQL to instantly query resources from JumpCloud. Open source CLI. No DB required.

33
Emerging
534 caiopizzol/fipe-data-pipeline

Coleta e processa dados históricos de preços da Tabela FIPE para PostgreSQL.

33
Emerging
535 MTSWebServices/spark-dialect-extension

Extend JDBC types support for Apache Spark.

33
Emerging
536 turbot/steampipe-plugin-heroku

Use SQL to instantly query apps, dynos and more from Heroku. Open source...

32
Emerging
537 MTSWebServices/horizon-hwm-store

Horizon HWM Store for onETL

32
Emerging
538 turbot/steampipe-plugin-trello

Use SQL to instantly query Trello organizations, boards, members,...

32
Emerging
539 turbot/steampipe-plugin-aiven

Use SQL to instantly query Aiven accounts, projects, teams, users & more....

32
Emerging
540 turbot/steampipe-plugin-fly

Use SQL to instantly query fly.io resources. Open source CLI. No DB required.

32
Emerging
541 turbot/steampipe-plugin-updown

Use SQL to instantly query status (e.g. checks, downtimes) from updown.io....

32
Emerging
542 qweliant/ankaa

POC for real-time monitoring and alert system for home hemodialysis,...

32
Emerging
543 thiskevinwang/terraform-provider-pinecone

Terraform Provider for managing resources on Pinecone.io

32
Emerging
544 samvardhan03/statforge

An open-source Python library and command-line interface designed to...

32
Emerging
545 turbot/steampipe-plugin-fastly

Use SQL to instantly query services, ACLs and more from Fastly. Open source...

32
Emerging
546 turbot/steampipe-plugin-env0

Use SQL to instantly query env0 resources. Open source CLI. No DB required.

32
Emerging
547 turbot/steampipe-plugin-awscfn

Use SQL to instantly query resources, data sources and more from AWS...

32
Emerging
548 turbot/steampipe-plugin-urlscan

Use SQL to instantly query urlscan.io. Open source CLI. No DB required.

32
Emerging
549 turbot/steampipe-plugin-pipes

Use SQL to instantly query Turbot Pipes resources across workspaces. Open...

32
Emerging
550 Ryanditko/Roadmap-Projects

A comprehensive collection of 180 curated project ideas across 6 technology...

32
Emerging
551 chayansraj/Python-ETL-pipeline-using-Airflow-on-AWS

This project demonstrates how to build and automate an ETL pipeline written...

32
Emerging
552 worldbank/OvertureLink-Data-Pipeline

This ETL pipeline allows you to query and extract Overture Maps data (such...

32
Emerging
553 masthead-data/terraform-google-masthead-agent

Google Cloud resources for Masthead Data agent integration.

32
Emerging
554 Joerndm/stock_portefolio_builder

Using Machine Learning to predict future stock prices and creating a stock...

32
Emerging
555 bytehub-ai/bytehub

ByteHub: making feature stores simple

32
Emerging
556 Valentine-Nkatha/app-owns-data

📊 Embed Power BI reports with ease using the App Owns Data pattern on...

32
Emerging
557 turbot/steampipe-plugin-splunk

Use SQL to instantly query logs, indexes, apps and more Splunk. Open source...

32
Emerging
558 turbot/steampipe-plugin-mailchimp

Use SQL to instantly query Mailchimp marketing data. Open source CLI. No DB required.

32
Emerging
559 turbot/steampipe-plugin-vercel

Use SQL to instantly query projects, teams, domains and more from Vercel....

32
Emerging
560 turbot/steampipe-plugin-panos

Use SQL to instantly query PAN-OS firewalls, security policies & more. Open...

32
Emerging
561 turbot/steampipe-plugin-planetscale

Use SQL to instantly query PlanetScale databases, branches and more. Open...

32
Emerging
562 Infostrux-Solutions/snowflake-zero-to-chat-with-your-data

Snowflake Hands-on Lab - Zero to Chat with Your Data in 120 Minutes

32
Emerging
563 alireza-heidarii/Real-Time-Data-Cleaning-Pipeline-for-Medical-and-Healthcare-Data

A real-time data cleaning pipeline for medical and healthcare data using...

32
Emerging
564 TelescopeAI/doorbeen

Connect LLMs to SQL Databases

32
Emerging
565 turbot/steampipe-plugin-newrelic

Use SQL to instantly query alerts, events, and more from New Relic. Open...

32
Emerging
566 ankitaaidev/VarunNetra

VarunNetra is an AI-powered software platform that lets researchers,...

32
Emerging
567 giagiannis/data-profiler

Data profiler is an attempt to model the behavior of a given operator for a...

31
Emerging
568 idlab-discover/RustiFlow

Flow feature extraction tool built in Rust using eBPF

31
Emerging
569 pybrave/brave

BRAVE:Bioinformatics Reactive Analysis and Visualization Engine

31
Emerging
570 luciferchase/chase_hospitals

This is a GUI based Python connectivity project on Hospital Management. The...

31
Emerging
571 Paulescu/bytewax-hopsworks-example

Compute and store real-time features for crypto trading using Bytwax (stream...

31
Emerging
572 BBVA/pacarana

A standalone ETL tool to generate advanced features for your Machine...

31
Emerging
573 tushar2704/SQL-Portfolio

Collection of personal SQL projects and queries I've worked on, showcasing...

31
Emerging
574 perguard/pg-streaming-performance-data

Data collection, feature engineering and machine learning of performance traces

31
Emerging
575 zovchik0v/task-management

🛠️ Streamline task management with this full-stack solution featuring...

31
Emerging
576 docglow/docglow

Modern documentation site generator for dbt Core — lineage explorer, health...

31
Emerging
577 cnstlungu/portable-data-stack-bruin

A portable Datamart and Business Intelligence suite built with Docker,...

31
Emerging
578 granthjoshi01/AQI-Analysis-Project

End-to-end AQI data pipeline with automated collection, historical storage,...

31
Emerging
579 hekmon/DiskTools

A set of sysadmin tools to better handle disk management

31
Emerging
580 Mindbaz/python-gpostmaster-domains-datas

Downloads and flattends datas from Google Postmaster Tools (GPT)

31
Emerging
581 Trojan3877/diabetes-prediction-ml-pipeline

The Diabetes Prediction ML Pipeline is a production-ready end-to-end...

31
Emerging
582 elqvixote/metalurgica-data

Open database for metallurgical & mining processes. Synthetic and official...

31
Emerging
583 forshaws/homebrew-oboi

A universal oboi binary installer. Install oboi email fraud scoring into...

31
Emerging
584 Toparvion/heap-talk

PoC for leveraging LLMs to JVM heap dump analysis

31
Emerging
585 eddmann/strava-cli

Strava from your terminal. Pipe it, script it, automate it.

30
Emerging
586 AvaAvarai/Java-Parallel-Coordinates-Vis

Java Parallel Coordinates Visualization Tool, to visualize...

30
Emerging
587 lyrasis/kiba-extend

Extensions to Kiba ETL

30
Emerging
588 seanpm2001/AI2001_Category-Source_Code-SC-SQLPL

🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️💾️📜️ The sourceCode:SQLPL category for AI2001,...

30
Emerging
589 everycure-org/kedro-argo

argo-kedro is a kedro-plugin for executing Kedro pipelines on Argo Workflows.

30
Emerging
590 nots1dd/Hydraware

A cross-platform solution toward a more hydro-knolwedgable world. Made for...

30
Emerging
591 Matrix030/SteamLensAI

Game analytics platform that converts Steam review data into actionable...

30
Emerging
592 gumberss/PurchaseListinator

Create your own purchase lists with its items and when you need to buy, use...

30
Emerging
593 wirthal1990-tech/USDA-Phytochemical-Database-JSON

76,907 phytochemical records enriched with PubMed, ClinicalTrials.gov,...

30
Emerging
594 seanpm2001/AI2001_Category-Source_Code-SC-SQL

🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️💾️📜️ The sourceCode:SQL category for AI2001, containing...

30
Emerging
595 tarek-clarke/resilient-rap-framework

A resilient, fault‑tolerant telemetry analytics pipeline designed to...

30
Emerging
596 fixedit-ai/fixedit-data-agent-examples

This repository contains resources for the FixedIT Data Agent ACAP including...

29
Experimental
597 ggongg/Power-BI-Fabric-Workshop-

This repository will have datasets which can be used for the workshop as...

29
Experimental
598 runprism/prism

Prism is the easiest way to develop, orchestrate, and execute data pipelines...

29
Experimental
599 refactoring-ai/Data-Collection

Collect refactorings with metrics from java source code.

29
Experimental
600 fishstormX/fishmaple

个人网站 https://www.fishmaple.cn

29
Experimental
« Prev 1 2 3 4 5 6 7 8 11 12 13 Next »