pnpnpn/dna2vec
dna2vec: Consistent vector representations of variable-length k-mers
This project helps biological researchers and computational biologists analyze DNA sequences by converting short DNA segments (k-mers) into numerical vectors. You input DNA sequence data, and it outputs these vector representations, which can then be used to compare different sequences and identify relationships. This is useful for anyone working with genomic data who needs to quantify similarities between DNA fragments.
192 stars. No commits in the last 6 months.
Use this if you need to represent DNA k-mers as consistent numerical vectors to find similarities or patterns in genomic data.
Not ideal if you are looking for tools to directly analyze gene expression, protein structure, or perform full genome assembly.
Stars
192
Forks
62
Language
Python
License
MIT
Category
Last pushed
Jun 21, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/pnpnpn/dna2vec"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shibing624/text2vec
text2vec, text to vector....
predict-idlab/pyRDF2Vec
đ Python Implementation and Extension of RDF2Vec
IntuitionEngineeringTeam/chars2vec
Character-based word embeddings model based on RNN for handling real world texts
IITH-Compilers/IR2Vec
Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.