lamalab-org/macbench
Probing the limitations of multimodal language models for chemistry and materials research
This tool helps chemistry and materials science researchers evaluate how well advanced AI models (multimodal language models) understand and respond to questions using both text and images in your field. You input a multimodal language model and a set of chemistry/materials research tasks, and it provides a report on the model's performance across various stages of scientific work. This is designed for scientists, engineers, and researchers who want to assess or compare AI models for scientific discovery workflows.
Use this if you need to objectively benchmark and understand the strengths and weaknesses of different multimodal AI models for tasks in chemistry and materials research.
Not ideal if you are looking for an AI model to directly perform scientific experiments or data analysis, rather than evaluating the models themselves.
Stars
23
Forks
1
Language
Python
License
MIT
Category
Last pushed
Feb 01, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/lamalab-org/macbench"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
theochem/ModelHamiltonian
Generate 1- and 2-electron integrals so that molecular quantum chemistry software can be used...
lamalab-org/chembench
How good are LLMs at chemistry?
pnnl/cactus
LLM Agent that leverages cheminformatics tools to provide informed responses.
jan-janssen/LangSim
Application of Large Language Models (LLM) for computational materials science - visit...
MasterAI-EAM/Darwin
An open-source project dedicated to build foundational large language model for natural science,...