gqgs/llm100kbench

LLM 100k portfolio management benchmark

/ 100

Experimental

This tool helps financial analysts and portfolio managers track and benchmark the investment decisions made by Large Language Models (LLMs). It takes in LLM-generated investment decisions (like buy/sell orders) and current market data, then outputs updated portfolio holdings and performance metrics. It's designed for professionals exploring how AI can assist with portfolio optimization.

No commits in the last 6 months.

Use this if you are a portfolio manager or financial analyst evaluating the performance and risk-reward profile of investment strategies generated by various LLMs.

Not ideal if you are looking for a tool to manage your personal stock portfolio or if you need to integrate with specific LLMs not supported by the current framework due to API limitations.

portfolio-management investment-analysis quantitative-finance AI-in-finance financial-modeling

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

License

—

Featured in

You're Shipping AI You Can't Measure

Higher-rated alternatives

sierra-research/tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

xlang-ai/OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

bigcode-project/bigcodebench

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

THUDM/AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

scicode-bench/SciCode

A benchmark that challenges language models to code solutions for scientific problems

Explore LLM Tools

All categories Trending LLM Tool directory Insights