Scikit-learn Pipelines ML Frameworks

End-to-end ML pipeline implementations using scikit-learn, focusing on workflow orchestration, preprocessing integration, and production-ready pipeline patterns. Does NOT include general ML tutorials, dataset collections, or frameworks that don't emphasize pipeline construction.

There are 29 scikit-learn pipelines frameworks tracked. 1 score above 70 (verified tier). The highest-rated is scverse/anndata at 82/100 with 720 stars. 1 of the top 10 are actively maintained.

Get all 29 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=scikit-learn-pipelines&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 scverse/anndata

Annotated data.

82
Verified
2 koaning/scikit-lego

Extra blocks for scikit-learn pipelines.

64
Established
3 googleapis/python-bigquery-dataframes

BigQuery DataFrames (also known as BigFrames)

59
Established
4 bigmlcom/python

Python bindings for BigML.io

53
Established
5 posit-dev/orbital

Turn SciKitLearn pipelines into SQL

48
Emerging
6 getyourguide/DDataFlow

A tool to help you to test and develop pyspark code with sampled and local data

46
Emerging
7 mindsdb/type_infer

Type inference for Machine Learning pipelines

46
Emerging
8 ibis-project/ibis-ml

IbisML is a library for building scalable ML pipelines using Ibis.

42
Emerging
9 m-nanda/End-to-End-ML

An "End-to-End Machine Learning" project focuses on building a machine...

39
Emerging
10 maximtrp/tmplot

Visualization of Topic Modeling Results

37
Emerging
11 data-science-lab-amsterdam/skippa

SciKIt-learn Pipeline in PAndas

36
Emerging
12 iMaatin/AutoStats

A libray for automatically cleaning, imputing and analyzing datasets with...

31
Emerging
13 layerai-archive/dbt-layer

Layer DBT Adapters

30
Emerging
14 zluvsand/ml_pipeline

⚡ Sample code for machine Learning Pipeline with Scikit-learn ⚡

29
Experimental
15 galafis/feature-store-engineering

Feature Store Engineering - Professional Python project

29
Experimental
16 elisim/hydra-sklearn-pipelines

Code accompanying the blogpost: "Creating Configurable Data Pre-Processing...

27
Experimental
17 galafis/Machine-Learning-Pipeline

Professional project by Gabriel Demetrios Lafis

26
Experimental
18 adamduval/ml_snowflake_end_to_end

❄️ End to End ML workflow in Snowflake.

23
Experimental
19 SathyaPrakashD/ml-pipeline-fundamentals

End-to-end scikit-learn ML pipelines across 6 datasets — classification,...

22
Experimental
20 miheo-al2/sklearn-selector-pipeline

🔧 Combine feature selectors with classifiers and regressors in a seamless...

22
Experimental
21 chrislemke/sk-transformers

A collection of pandas & scikit-learn compatible transformers for...

21
Experimental
22 Montasir00/Ml_final_project

End-to-End process of building machine learning models

21
Experimental
23 Vidhi1290/Machine-learning-Pipeline

Explore a collection of Jupyter notebooks that guide you through various...

19
Experimental
24 spen-c/ml-portfolio

Machine learning projects built on a modular, config-driven framework...

19
Experimental
25 Fugant1/ml-model-factory

Automated ML pipeline

19
Experimental
26 oumaimabnz/python-data-processing-pipeline

End-to-end Python data processing pipeline for cleaning, analyzing, and...

14
Experimental
27 Shashank911/-End-to-End-Machine-Learning-Pipeline

The objective of this task is to build an end-to-end machine learning...

13
Experimental
28 Hasnat-Aarif-Aslam/Pipelining-How-to-Create-Pipelines-in-Machine-Learning

Pipeline creation using Pipeline and make_pipeline along with...

11
Experimental
29 muhammadhussain-2009/Machine-Learning-Pipeline-

Pipeline Designed to Simplify Complexities of Building ML Models

11
Experimental