ML4GLand/SeqData

Annotated sequence data

21
/ 100
Experimental

This project helps bioinformaticians and computational biologists prepare genomic sequence data for machine learning. It takes raw sequence data from common formats like FASTA, BigWig, and BAM files, then organizes it into a single, structured object ready for training models. Researchers working with genetic sequences for predictive analysis would use this.

No commits in the last 6 months.

Use this if you need to efficiently load and manage large genomic datasets, including sequences, coverage, and metadata, to train machine learning models.

Not ideal if you are not working with genomic sequence data or do not plan to use machine learning for your analysis.

genomic-sequencing bioinformatics computational-biology genomics epigenetics
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

11

Forks

Language

Jupyter Notebook

License

MIT

Category

dna-sequence-ml

Last pushed

Feb 02, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ML4GLand/SeqData"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.