braingpt-lovelab/BrainBench

Source code for

/ 100

Emerging

This project helps neuroscience researchers replicate and extend findings from a study that used Large Language Models (LLMs) to predict neuroscience experimental results. It takes raw experimental data and generates analyses and plots, allowing researchers to reproduce the original paper's figures and findings. This is for neuroscientists and cognitive scientists interested in using or evaluating AI for scientific discovery.

No commits in the last 6 months.

Use this if you are a neuroscience researcher looking to reproduce or build upon the results of the "Large language models surpass human experts in predicting neuroscience results" paper.

Not ideal if you are a general AI developer looking for a framework unrelated to neuroscience research replication.

neuroscience-research experimental-replication cognitive-science AI-in-science scientific-discovery

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

—

License

Apache-2.0

Featured in

You're Shipping AI You Can't Measure

Higher-rated alternatives

sierra-research/tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

xlang-ai/OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

bigcode-project/bigcodebench

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

THUDM/AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

scicode-bench/SciCode

A benchmark that challenges language models to code solutions for scientific problems

Explore LLM Tools

All categories Trending LLM Tool directory Insights