ML-Bioinfo-CEITEC/genomic_benchmarks

Benchmarks for classification of genomic sequences

54
/ 100
Established

This project helps bioinformaticians and genomics researchers evaluate how well different machine learning models can classify genomic sequences. It provides standardized collections of genomic data, like human promoter or enhancer sequences, as input. The output helps you understand which models are best for tasks such as identifying functional regions in DNA based on sequence data.

173 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are a bioinformatician or computational biologist developing or testing machine learning models for genomic sequence classification and need standardized datasets and evaluation metrics.

Not ideal if you are looking for a pre-trained model for direct application, rather than a resource for model development and benchmarking.

genomics bioinformatics DNA-sequence-analysis machine-learning-in-genomics genomic-data-classification
Stale 6m
Maintenance 2 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 17 / 25

How are scores calculated?

Stars

173

Forks

24

Language

Jupyter Notebook

License

Apache-2.0

Category

dna-sequence-ml

Last pushed

Aug 14, 2025

Commits (30d)

0

Dependencies

9

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ML-Bioinfo-CEITEC/genomic_benchmarks"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.