IngestAI/embedditor
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
This tool helps anyone working with Large Language Models (LLMs) to refine the text data before it's processed into numerical embeddings. You can upload content, visually edit chunks, metadata, and tokens, then download the pre-processed data as a .veml or .json file, leading to more accurate and cost-efficient vector search results. It's designed for anyone managing data for AI applications.
229 stars. No commits in the last 6 months.
Use this if you need fine-grained control over how your text content is prepared for vector embeddings, ensuring better relevance and reduced costs for your LLM or AI search applications.
Not ideal if you prefer fully automated, hands-off data preparation for your embeddings, without needing to manually inspect and edit content chunks.
Stars
229
Forks
13
Language
PHP
License
AGPL-3.0
Category
Last pushed
Nov 21, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/IngestAI/embedditor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
different-ai/embedbase
A dead-simple API to build LLM-powered apps
guangzhengli/vectorhub
Quickly and easily build AI website or application by using embeddings!
yuniko-software/qwen3-tokenizer-dotnet
Multi-language BPE tokenizer implementation for Qwen3 models. Lightweight byte-pair encoding for C#/.NET
ux-nl/laravel-embeddings
Create embeddings for your Eloquent models to use with OpenAI
mpilhlt/embapi
EmbAPI (/ɛmˈbɑːpeɪ/) ⚽, a RESTful Embeddings API