Dicklesworthstone/swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
This project helps you find specific information within your collection of documents and audio files using advanced search methods. You feed it various file types like PDFs, Word documents, images, and audio, and it processes them to extract text and generate numerical representations. These representations allow you to perform powerful semantic searches and retrieve relevant content, making it useful for researchers, analysts, or anyone needing to quickly sift through large datasets.
1,048 stars. No commits in the last 6 months.
Use this if you need to perform highly accurate and nuanced searches across a diverse collection of documents, images, and audio files, going beyond simple keyword matching.
Not ideal if you only need basic text search, have a very small collection of simple text files, or prefer a ready-to-use application rather than integrating with an API.
Stars
1,048
Forks
65
Language
Python
License
—
Category
Last pushed
Feb 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Dicklesworthstone/swiss_army_llama"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
voyage-ai/voyageai-python
Voyage AI Official Python Library
awinml/voyage-embedders-haystack
Custom components for Haystack for creating embeddings and reranking documents using the VoyageAI Models.
estebanpdl/osintgpt
An open-source intelligence (OSINT) analysis tool leveraging GPT-powered embeddings and vector...
FullStackWithLawrence/openai-embeddings
OpenAI chatGPT hybrid search and retrieval augmented generation
patelvivekdev/voyageai-ai-provider
The Voyage AI Provider is a provider for the Vercel AI SDK. It provides a simple interface to...