The Data Engineering Directory

Quality-scored directory of 1,297 data engineering tools, updated daily. Every tool scored on maintenance, adoption, maturity, and community signals.

Data engineering tools for building data pipelines, ETL workflows, data quality, and data infrastructure.

Verified

46

70–100

Established

182

50–69

Emerging

367

30–49

Experimental

702

10–29

Top tools by quality score

# Tool Score
1 PrefectHQ/prefect

Prefect is a workflow orchestration framework for building resilient data...

95
2 growthbook/growthbook

Open Source Feature Flags, Experimentation, and Product Analytics

90
3 koopjs/koop

Transform, query, and download geospatial data on the web.

89
4 pathwaycom/pathway

Python ETL framework for stream processing, real-time analytics, LLM...

85
5 dagster-io/dagster

An orchestration platform for the development, production, and observation...

84
6 supabase/supabase-py

Python Client for Supabase. Query Postgres from Flask, Django, FastAPI....

81
7 dlt-hub/dlt

data load tool (dlt) is an open source Python library that makes data...

80
8 meltano/meltano

Meltano: the declarative code-first data integration engine that powers your...

79
9 capitalone/locopy

locopy: Loading/Unloading to Redshift and Snowflake using Python.

79
10 Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is...

79
11 apache/hop

Hop Orchestration Platform

76
12 apache/superset

Apache Superset is a Data Visualization and Data Exploration Platform

76
13 airbytehq/airbyte

The leading data integration platform for ETL / ELT data pipelines from...

76
14 pyjanitor-devs/pyjanitor

Clean APIs for data cleaning. Python implementation of R package Janitor

76
15 apache/shardingsphere

Empowering Data Intelligence with Distributed SQL for Sharding, Scalability,...

76
16 catalyst-cooperative/pudl

The Public Utility Data Liberation Project provides analysis-ready energy...

76
17 debezium/debezium

Change data capture for a variety of databases. Please log issues at...

76
18 quiltdata/quilt

Quilt is a Scientific Data Management Platform on AWS that helps teams and...

75
19 bruin-data/ingestr

ingestr is a CLI tool to copy data between any databases with a single...

74
20 apache/incubator-devlake

Apache DevLake is an open-source dev data platform to ingest, analyze, and...

74

Browse by category

Data Pipeline Frameworks

261 tools

SQL Query Adapters

106 tools

Uncategorized

75 tools

Data Analytics Platforms

22 tools

Spark Hadoop Ml Pipelines

20 tools

Business Intelligence Dashboards

20 tools

Generic Workflow Tools

17 tools

Subscription Management Demos

17 tools

Natural Language Sql Builders

16 tools

Csv Data Chat

15 tools

Ai Business Analytics

15 tools

Generic Project Templates

13 tools

Ml Experiment Tracking

11 tools

Mlops Workflow Orchestration

11 tools

Ai Stock Analysis

11 tools

Ml Api Deployment

9 tools

Anomaly Detection Systems

9 tools

Natural Language Sql Querying

9 tools

Stock Analysis Dashboards

9 tools

Data Quality Preprocessing

8 tools

Rust Tensor Frameworks

8 tools

Financial News Sentiment

8 tools

Energy Sector Forecasting

8 tools

Real Time Threat Detection

7 tools

Fullstack Ai Monorepos

7 tools

Financial Intelligence Rag

7 tools

Water Quality Prediction

7 tools

Finance Dashboard Frameworks

7 tools

Inventory Management Systems

7 tools

Ecommerce Customer Analytics

7 tools

Twitter Sentiment Pipelines

6 tools

Ai Vulnerability Scanning

6 tools

Semantic Search Applications

5 tools

Distributed Training Frameworks

5 tools

Natural Language Database Agents

5 tools

Scikit Learn Pipelines

5 tools

Ai Workflow Automation

5 tools

Go Ml Bindings

5 tools

Government Procurement Docs

5 tools

Document Data Extraction

4 tools

Sql Database Mcp

4 tools

Llm Data Labeling

4 tools

Model Inference Serving

4 tools

Platform Sdk Libraries

4 tools

Ai Workflow Orchestration

4 tools

Ev Fleet Optimization

4 tools

Streamlit Ml Dashboards

4 tools

Wiki Knowledge Retrieval

4 tools

Data Analytics Agents

4 tools

Rfm Customer Segmentation

4 tools

Ml Model Serving

4 tools

Langchain Agent Frameworks

4 tools

Text To Sql Generation

4 tools

Nba Game Prediction

4 tools

Customer Churn Prediction

4 tools

Interactive Llm Applications

4 tools

Ai Test Automation

3 tools

Open Source Contribution Guides

3 tools

Text Visualization Graphs

3 tools

Document Intelligence Extraction

3 tools

Open Dataset Collections

3 tools

Aws Bedrock Applications

3 tools

Julia Ml Frameworks

3 tools

Disk Imaging Tools

3 tools

Personal Portfolio Showcases

3 tools

Llm Json Streaming

3 tools

Agentic Workflow Orchestration

3 tools

N8N Workflow Automation

3 tools

Review Sentiment Classification

3 tools

Ai Investment Platforms

3 tools

Web Scraping Nlp Pipelines

3 tools

Pii Detection Redaction

3 tools

Javascript Ml Libraries

3 tools

Fraud Detection Ml

3 tools

Network Intrusion Detection

3 tools

Stock Market Prediction

3 tools

Secure Evoting Systems

3 tools

Personal Portfolio Websites

3 tools

Personal Finance Tracking

3 tools

Data Science Bootcamp Portfolios

3 tools

Travel Planning Ai

3 tools

Nutrition Ai Apps

3 tools

Ai Professional Portfolios

3 tools

Real Estate Valuation Apps

3 tools

Financial Sentiment Analysis

3 tools

Commodity Price Forecasting

3 tools

Healthcare Ai Diagnostics

3 tools

Rust Ml Libraries

3 tools

Demand Forecasting Systems

3 tools

Traffic Accident Prediction

3 tools

Recipe Recommendation Systems

3 tools

Saas Ai Platforms

2 tools

Local Semantic Search

2 tools

Scala Ml Frameworks

2 tools

Data Warehouse Mcp

2 tools

Code Context Packaging

2 tools

Mlops Framework Directories

2 tools

Go Agent Frameworks

2 tools

Mojo Ml Frameworks

2 tools

Icu Patient Risk Prediction

2 tools

Openapi Mcp Generation

2 tools

Resume Job Matching

2 tools

Multi Agent Orchestration

2 tools

Geospatial Ml Tools

2 tools

Audio Transcription Apps

2 tools

Code Quality Analysis

2 tools

Academic Capstone Projects

2 tools

Disaster Prediction Ml

2 tools

Claude Skill Orchestration

2 tools

Copilot Educational Curriculum

2 tools

Personal Blogs Portfolios

2 tools

Power Transformer Design

2 tools

Local Llm Deployment

2 tools

Telemedicine Consultation Platforms

2 tools

Blackroad Os Ecosystem

2 tools

Web3 Contract Security

2 tools

Air Quality Forecasting

2 tools

Llm Web Scraping

2 tools

Agentic Rag Systems

2 tools

Taxi Fare Prediction

2 tools

Llm Cost Tracking

2 tools

Ai Text Humanization

2 tools

Github Repository Analysis

2 tools

Event Discovery Platforms

2 tools

Vector Db From Scratch

2 tools

Sentiment Analysis Applications

2 tools

Document Intelligence Rag

2 tools

Agentic Workflow Builders

2 tools

Task Management Mcp

2 tools

Financial News Rag

2 tools

Job Market Analytics

2 tools

Llm Recommendation Systems

2 tools

Question Answering Systems

2 tools

Space Hazard Detection

2 tools

Kubernetes Ml Deployment

2 tools

Ai Investment Analysis

2 tools

Web Framework Templates

2 tools

Career Guidance Ai

2 tools

Semantic Search Engines

2 tools

Spring Ai Applications

2 tools

Retail Sales Forecasting

2 tools

Crypto Price Prediction

2 tools

Production Copilot Systems

2 tools

Rag Techniques Frameworks

2 tools

Web Scraping Tools

2 tools

Web To Markdown Rag

2 tools

Email Ai Automation

2 tools

Domain Specific Ai Assistants

2 tools

Codebase Chat Rag

2 tools

Crop Yield Prediction

2 tools

Agentic Ai Frameworks

1 tools

Rust Native Vectordbs

1 tools

Slack Mcp Servers

1 tools

Obsidian Ai Plugins

1 tools

Production Rag Pipelines

1 tools

Llm Data Visualization

1 tools

Langchain Tool Integrations

1 tools

Automl Frameworks

1 tools

Chatbot Frameworks

1 tools

Self Hosted Rag Platforms

1 tools

Regional Fiscal Data

1 tools

Postgres Vector Rag

1 tools

Edge Device Ml Frameworks

1 tools

Langchain Starter Projects

1 tools

React Speech Recognition

1 tools

Aws Cloud Services

1 tools

Llm Evaluation Platforms

1 tools

Mcp Client Configuration

1 tools

Social Media Trends

1 tools

Network Traffic Classification

1 tools

Langchain Application Tutorials

1 tools

Vector Db Benchmarking

1 tools

File Content Extraction

1 tools

Ml Platform Builder

1 tools

Bayesian Inference Frameworks

1 tools

Nlp Dataset Collections

1 tools

Telegram Bot Integration

1 tools

Mlr3 Ecosystem

1 tools

Openapi Client Generators

1 tools

Generative Ai Workshops

1 tools

Election Sentiment Forecasting

1 tools

Portuguese Nlp Tools

1 tools

Financial Trading Ml

1 tools

Agent Orchestration Platforms

1 tools

Rust Nlp Bindings

1 tools

Generative Ai Education

1 tools

Accounting Software Mcp

1 tools

Ai Search Optimization

1 tools

Production Rag Systems

1 tools

Legal Document Analysis

1 tools

Graph Database Rag

1 tools

Temporal Expression Parsing

1 tools

Pinecone Vector Search

1 tools

Voice Ai Learning Collections

1 tools

Chatgpt Web Automation

1 tools

Llm Pricing Comparison

1 tools

Rare Historical Language Datasets

1 tools

Multi Crop Disease Detection

1 tools

Ecommerce Ai Assistants

1 tools

Mlops End To End

1 tools

C Family Language Datasets

1 tools

Java Ml Implementations

1 tools

Food Nutrition Rag

1 tools

Esports Match Prediction

1 tools

Graphql Code Generators

1 tools

Text Authenticity Detection

1 tools

Fasttext Serving Wrappers

1 tools

Go Llm Frameworks

1 tools

Reddit Sentiment Analysis

1 tools

Wildfire Prediction Ml

1 tools

Ml Algorithm Visualizations

1 tools

Pdf Chat Applications

1 tools

Llm Provider Sdks

1 tools

Ai Engineering Fundamentals

1 tools

Langchain Chatbot Templates

1 tools

Regulatory Intelligence Mcp

1 tools

Ai Powered Saas Startups

1 tools

Movie Revenue Prediction

1 tools

Llm Evaluation Benchmarking

1 tools

Credit Risk Modeling

1 tools

Rust Neural Networks

1 tools

Zero Knowledge Ml

1 tools

Prompt Crafting Assistants

1 tools

Domain Specific Workflows

1 tools

Agentic Team Frameworks

1 tools

Academic Research Rag

1 tools

Spotify Music Recommendation

1 tools

Sagemaker Ml Platforms

1 tools

Phishing Url Detection

1 tools

Text Analysis Visualization

1 tools

Kubernetes Ai Dashboards

1 tools

Bioacoustic Species Classification

1 tools

Student Performance Prediction

1 tools

Ai Image Generation Platforms

1 tools

Ibm Professional Certificates

1 tools

Hackathon Submission Projects

1 tools

Go Llm Sdks

1 tools

Genomic Variant Analysis

1 tools

Financial Ai Agents

1 tools

Claude Api Proxies

1 tools

Neural Architecture Search

1 tools

Fullstack Ai Assistants

1 tools

Gemini Ai Applications

1 tools

Agent Threat Detection

1 tools

Phishing Browser Extensions

1 tools

Ecommerce Ai Agents

1 tools

Ai Recipe Generation

1 tools

Chat Export Tools

1 tools

Chronic Disease Prediction

1 tools

Autonomous Trading Agents

1 tools

Ocr Document Extraction

1 tools

Ai Interview Prep

1 tools

Pdf Document Chatbots

1 tools

Vector Database Libraries

1 tools

Algorithmic Trading Bots

1 tools

Dna Sequence Ml

1 tools

Clustering Algorithm Implementations

1 tools

Mcp Server Management

1 tools

Complaint Classification

1 tools

Weather Forecasting Ml

1 tools

Workflow Automation Platforms

1 tools

Ml Reference Guides

1 tools

Uipath Rpa Training

1 tools

Vector Embedding Storage

1 tools

Go Nlp Libraries

1 tools

Text Preprocessing Pipelines

1 tools

Web Scraping Mcp

1 tools

Learning Path Roadmaps

1 tools

N8N Workflow Templates

1 tools

Developer Portfolio Projects

1 tools

Ai Fitness Coaching

1 tools

Ai Image Generation

1 tools

Resume Matching Screening

1 tools

Crime Prediction Analytics

1 tools

Cloud Infrastructure Agents

1 tools

Enterprise Agentic Rag

1 tools

Restaurant Rating Prediction

1 tools

Laptop Price Prediction

1 tools

Conversational Chatbot Applications

1 tools

Linkedin Mcp Servers

1 tools

Ai Red Teaming

1 tools

Ml Benchmarking Frameworks

1 tools

Football Match Prediction

1 tools

Marine Ecosystem Ai

1 tools

Multi Disease Risk Assessment

1 tools

Quantum Machine Learning

1 tools

R Language Ml Education

1 tools

Agent Crypto Marketplaces

1 tools

Content To Podcast Converters

1 tools

Formula 1 Race Prediction

1 tools

Internship Portfolio Projects

1 tools

Langraph Production Agents

1 tools

Market Research Agents

1 tools

Ml Project Portfolios

1 tools

Prompt Injection Security

1 tools

Restaurant Ordering Chatbots

1 tools

Sms Spam Detection

1 tools

Spring Ai Backends

1 tools

Traffic Signal Optimization

1 tools

Ai Agent Governance

1 tools

Ai Coding Assistants

1 tools

Ai Marketing Automation

1 tools

Bank Deposit Prediction

1 tools

Gemini Python Integrations

1 tools

Text Summarization Transformers

1 tools

Personal Portfolio Profiles

1 tools

Psa Rmm Ticketing

1 tools

Github Repository Agents

1 tools

Agent Memory Systems

1 tools

Portfolio Showcase Builders

1 tools

Waste Detection Classification

1 tools

Production Rag Chatbots

1 tools

Tokenizer Libraries

1 tools

Product Review Sentiment

1 tools

Diabetes Glucose Prediction

1 tools

Agent Framework Patterns

1 tools

Ai Legal Tech

1 tools

Ai Interview Coaching

1 tools

Government Open Data

1 tools

Rag Curated Resources

1 tools

Ai Recruitment Platforms

1 tools

Tauri Chatgpt Clients

1 tools

Bike Sharing Demand Prediction

1 tools

Blockchain Ai Integration

1 tools

Security Testing Mcp

1 tools

Ai Saas Builders

1 tools

Minecraft Ai Agents

1 tools

Streamlit Agent Dashboards

1 tools

Product Search Systems

1 tools

Traffic Flow Prediction

1 tools

Aws Ml Certification

1 tools

Forest Tree Detection

1 tools

Ai Service Sdks

1 tools

Semantic Textual Similarity

1 tools

Mnist Digit Classification

1 tools

Ai Nutrition Analysis

1 tools

Academic Thesis Nlp

1 tools

Agentic Ai Orchestration

1 tools

Medium Article Collections

1 tools

Ai Project Management

1 tools

Ml Learning Resources

1 tools

Diffusion Web Interfaces

1 tools

Personal Github Profiles

1 tools

Data Science Bootcamps

1 tools

Airbnb Price Prediction

1 tools

Ml Fundamentals Education

1 tools

Healthcare Ai Applications

1 tools

Hackathon Project Submissions

1 tools

Titanic Kaggle Competition

1 tools

Youtube Comment Analysis

1 tools

Car Price Prediction

1 tools

Rust Llm Chatbots

1 tools

Covid 19 Prediction Ml

1 tools

Gpt Rag Foundations

1 tools

Portfolio Optimization Ml

1 tools

Flight Delay Prediction

1 tools

Pathfinding Algorithm Implementations

1 tools

Health App Development

1 tools

Internship Project Portfolios

1 tools

Document Ocr Extraction

1 tools

Static Site Generators

1 tools

Gaming Aim Assistance

1 tools