yecchen/MIRAI
Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"
This project helps international relations analysts and geopolitical strategists predict future international events. It takes historical data on global events and news articles as input and outputs forecasts about how relations between countries might change. The end-user is typically someone who needs to understand and anticipate geopolitical shifts.
No commits in the last 6 months.
Use this if you need to evaluate how well large language model agents can forecast international events by analyzing historical data and news.
Not ideal if you are looking for a pre-built, production-ready event forecasting application rather than a benchmark for evaluating AI models.
Stars
90
Forks
18
Language
Python
License
—
Category
Last pushed
Jul 02, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/yecchen/MIRAI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MMMU-Benchmark/MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal...
pat-jj/DeepRetrieval
[COLM’25] DeepRetrieval — 🔥 Training Search Agent by RLVR with Retrieval Outcome
lupantech/MathVista
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
x66ccff/liveideabench
[𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea...
ise-uiuc/magicoder
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct