Mvila035/dna_parser
A Python module written in rust to encode DNA sequences for machine learning
This tool helps computational biologists and bioinformaticians prepare DNA or RNA sequences for machine learning models. You input raw genetic sequences, and it transforms them into numerical representations like one-hot encodings, which are essential for algorithms to process biological data. This is ideal for researchers developing predictive models in genetics.
No commits in the last 6 months.
Use this if you need to quickly and efficiently convert large sets of raw DNA or RNA sequences into numerical features for machine learning applications.
Not ideal if you are looking for tools to perform sequence alignment, phylogenetic analysis, or other non-encoding related bioinformatics tasks.
Stars
7
Forks
—
Language
Rust
License
MIT
Category
Last pushed
Jun 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Mvila035/dna_parser"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
helicalAI/helical
A framework for state-of-the-art pre-trained bio foundation models on genomics and...
instadeepai/nucleotide-transformer
Foundation Models for Genomics & Transcriptomics
ML-Bioinfo-CEITEC/genomic_benchmarks
Benchmarks for classification of genomic sequences
FunctionLab/selene
a framework for training sequence-level deep learning networks
modernatx/seqlike
Unified biological sequence manipulation in Python