jerryji1993/DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

61
/ 100
Established

This project offers pre-trained models that understand the "language" of DNA sequences. It helps biologists and geneticists analyze genomic data by taking raw DNA sequences as input and outputting classifications or insights about their function. Researchers can use these models to perform various tasks like identifying regulatory regions or classifying genetic variations.

746 stars.

Use this if you need to analyze large volumes of DNA sequences to find patterns, classify functions, or predict characteristics, especially if you're working with genomic data from multiple species.

Not ideal if you are looking for a simple, out-of-the-box software solution that doesn't require any command-line interaction or model fine-tuning.

genomics bioinformatics DNA-sequencing gene-expression genetic-variation
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

746

Forks

177

Language

Python

License

Apache-2.0

Category

dna-sequence-ml

Last pushed

Jan 22, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jerryji1993/DNABERT"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.