ML-Bioinfo-CEITEC/genomic_benchmarks

Benchmarks for classification of genomic sequences

/ 100

Established

This project helps bioinformaticians and genomics researchers evaluate how well different machine learning models can classify genomic sequences. It provides standardized collections of genomic data, like human promoter or enhancer sequences, as input. The output helps you understand which models are best for tasks such as identifying functional regions in DNA based on sequence data.

173 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are a bioinformatician or computational biologist developing or testing machine learning models for genomic sequence classification and need standardized datasets and evaluation metrics.

Not ideal if you are looking for a pre-trained model for direct application, rather than a resource for model development and benchmarking.

genomics bioinformatics DNA-sequence-analysis machine-learning-in-genomics genomic-data-classification

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 17 / 25

How are scores calculated?

Stars

173

Forks

Language

Jupyter Notebook

License

Apache-2.0

Related frameworks

helicalAI/helical

A framework for state-of-the-art pre-trained bio foundation models on genomics and...

instadeepai/nucleotide-transformer

Foundation Models for Genomics & Transcriptomics

FunctionLab/selene

a framework for training sequence-level deep learning networks

modernatx/seqlike

Unified biological sequence manipulation in Python

Future-House/BixBench

Benchmark for LLM-based Agents in Computational Biology

Explore ML Frameworks

All categories Trending ML Framework directory Insights