Future-House/BixBench

Benchmark for LLM-based Agents in Computational Biology

/ 100

Emerging

This project provides a standardized way to test how well AI agents can handle real-world computational biology tasks. It takes in an AI agent and a set of bioinformatics problems, then outputs a performance evaluation, showing how accurately the agent can explore datasets, perform multi-step analyses, and interpret results. This is useful for researchers and developers working on AI agents for biological research.

No commits in the last 6 months.

Use this if you are developing or evaluating AI agents designed to solve complex problems in bioinformatics and want to rigorously benchmark their capabilities against a diverse set of real-world scenarios.

Not ideal if you are looking for a tool to directly perform bioinformatics analyses or develop new biological models, as this is a benchmarking framework, not an analysis tool.

computational-biology bioinformatics AI-agent-evaluation biological-data-analysis scientific-AI-benchmarking

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

helicalAI/helical

A framework for state-of-the-art pre-trained bio foundation models on genomics and...

instadeepai/nucleotide-transformer

Foundation Models for Genomics & Transcriptomics

ML-Bioinfo-CEITEC/genomic_benchmarks

Benchmarks for classification of genomic sequences

FunctionLab/selene

a framework for training sequence-level deep learning networks

modernatx/seqlike

Unified biological sequence manipulation in Python

Explore ML Frameworks

All categories Trending ML Framework directory Insights