scikit-learn-contrib/hdbscan

A high performance implementation of HDBSCAN clustering.

77
/ 100
Verified

This tool helps data analysts and researchers find natural groupings (clusters) within their complex datasets. You input your raw data, and it outputs labels indicating which cluster each data point belongs to, even if clusters are of different densities or have noise. It's designed for anyone working with data who needs to identify underlying structures or patterns without extensive trial-and-error.

3,080 stars. Used by 16 other packages. Actively maintained with 4 commits in the last 30 days. Available on PyPI.

Use this if you need to quickly and reliably find clusters in your data, especially when you suspect clusters might have varying densities or your data contains significant noise, without extensive parameter tuning.

Not ideal if you require every single data point to be assigned to a cluster, as this algorithm is designed to identify and leave out noisy data points.

data-analysis pattern-recognition customer-segmentation anomaly-detection exploratory-data-science
Maintenance 13 / 25
Adoption 15 / 25
Maturity 25 / 25
Community 24 / 25

How are scores calculated?

Stars

3,080

Forks

531

Language

Jupyter Notebook

License

BSD-3-Clause

Last pushed

Jan 26, 2026

Commits (30d)

4

Dependencies

4

Reverse dependents

16

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/scikit-learn-contrib/hdbscan"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.