jerryji1993/DNABERT
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
This project offers pre-trained models that understand the "language" of DNA sequences. It helps biologists and geneticists analyze genomic data by taking raw DNA sequences as input and outputting classifications or insights about their function. Researchers can use these models to perform various tasks like identifying regulatory regions or classifying genetic variations.
746 stars.
Use this if you need to analyze large volumes of DNA sequences to find patterns, classify functions, or predict characteristics, especially if you're working with genomic data from multiple species.
Not ideal if you are looking for a simple, out-of-the-box software solution that doesn't require any command-line interaction or model fine-tuning.
Stars
746
Forks
177
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jerryji1993/DNABERT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.