projectglow/glow

An open-source toolkit for large-scale genomic analysis

/ 100

Established

This toolkit helps bioinformaticians and genetic researchers process extremely large genomic datasets, like those found in biobanks. It takes raw genomic data files (VCF, BGEN, Plink) and allows users to perform quality control, normalize variants, conduct genome-wide association studies, and integrate with other health data. The output is analyzed genomic insights, scaled to handle massive volumes of data.

296 stars.

Use this if you are a bioinformatician or geneticist working with very large genomic datasets and need to perform complex analyses and integrate different data types at scale.

Not ideal if you are working with small genomic datasets or prefer not to use the Apache Spark ecosystem for your analyses.

genomic-analysis bioinformatics genetic-research biobank-data population-genetics

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

296

Forks

118

Language

Scala

License

Apache-2.0

Related frameworks

tensorflow/tfx

TFX is an end-to-end platform for deploying production ML pipelines

VowpalWabbit/vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with...

yahoo/TensorFlowOnSpark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

Wei-1/Scala-Machine-Learning

No Dependency Scala Machine Learning Algorithm Gallery

yoshoku/rumale

Rumale is a machine learning library in Ruby

Explore ML Frameworks

All categories Trending ML Framework directory Insights