scikit-learn-contrib/hdbscan

A high performance implementation of HDBSCAN clustering.

/ 100

Verified

This tool helps data analysts and researchers find natural groupings (clusters) within their complex datasets. You input your raw data, and it outputs labels indicating which cluster each data point belongs to, even if clusters are of different densities or have noise. It's designed for anyone working with data who needs to identify underlying structures or patterns without extensive trial-and-error.

3,080 stars. Used by 16 other packages. Actively maintained with 4 commits in the last 30 days. Available on PyPI.

Use this if you need to quickly and reliably find clusters in your data, especially when you suspect clusters might have varying densities or your data contains significant noise, without extensive parameter tuning.

Not ideal if you require every single data point to be assigned to a cluster, as this algorithm is designed to identify and leave out noisy data points.

data-analysis pattern-recognition customer-segmentation anomaly-detection exploratory-data-science

Maintenance 13 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 24 / 25

How are scores calculated?

Stars

3,080

Forks

531

Language

Jupyter Notebook

License

BSD-3-Clause

Related frameworks

annoviko/pyclustering

pyclustering is a Python, C++ data mining library.

panagiotisanagnostou/HiPart

Hierarchical divisive clustering algorithm execution, visualization and Interactive visualization.

erdogant/clusteval

Clusteval provides methods for unsupervised cluster validation

mqcomplab/MDANCE

MDANCE: O(N) clustering for molecular dynamics. Process 1.5M frames in 40min. 8 specialized algorithms.

wq2012/SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's...

Explore ML Frameworks

All categories Trending ML Framework directory Insights