ML-Bioinfo-CEITEC/genomic_benchmarks
Benchmarks for classification of genomic sequences
This project helps bioinformaticians and genomics researchers evaluate how well different machine learning models can classify genomic sequences. It provides standardized collections of genomic data, like human promoter or enhancer sequences, as input. The output helps you understand which models are best for tasks such as identifying functional regions in DNA based on sequence data.
173 stars. No commits in the last 6 months. Available on PyPI.
Use this if you are a bioinformatician or computational biologist developing or testing machine learning models for genomic sequence classification and need standardized datasets and evaluation metrics.
Not ideal if you are looking for a pre-trained model for direct application, rather than a resource for model development and benchmarking.
Stars
173
Forks
24
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Aug 14, 2025
Commits (30d)
0
Dependencies
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ML-Bioinfo-CEITEC/genomic_benchmarks"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
helicalAI/helical
A framework for state-of-the-art pre-trained bio foundation models on genomics and...
instadeepai/nucleotide-transformer
Foundation Models for Genomics & Transcriptomics
FunctionLab/selene
a framework for training sequence-level deep learning networks
modernatx/seqlike
Unified biological sequence manipulation in Python
Future-House/BixBench
Benchmark for LLM-based Agents in Computational Biology