nathanielknight/llm-questioncache
An LLM plugin to efficiently pose questions to LLMs, cache the answers, and quickly retrieve answers to questions that you've already posed.
This tool helps developers efficiently manage their interactions with Large Language Models (LLMs). It takes natural language questions as input, routes them to an LLM if new, or retrieves previously stored answers. The output is a concise answer to the question, and it's designed for developers who frequently query LLMs and want to save time and reduce API costs.
No commits in the last 6 months.
Use this if you are a developer who repeatedly asks similar questions to LLMs and wants to quickly retrieve past answers without incurring new API calls.
Not ideal if you rarely interact with LLMs or if every question you ask is unique and requires a fresh response.
Stars
10
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 09, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/nathanielknight/llm-questioncache"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
aiming-lab/SimpleMem
SimpleMem: Efficient Lifelong Memory for LLM Agents
zilliztech/memsearch
A Markdown-first memory system, a standalone library for any AI agent. Inspired by OpenClaw.
microsoft/kernel-memory
Research project. A Memory solution for users, teams, and applications.
zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
TeleAI-UAGI/telemem
TeleMem is a high-performance drop-in replacement for Mem0, featuring semantic deduplication,...