unmonoqueteclea/voilib

🎧 Podcast Search Engine. Try it now for free or run your own instance.

Archived

/ 100

Emerging

Implements semantic search over podcast transcripts by dividing episodes into ~40-word fragments and storing their embeddings (384-dimensional vectors) in Qdrant. The pipeline chains OpenAI's Whisper for transcription, embedding generation for semantic indexing, and vector similarity search—supporting both RSS-sourced podcasts and custom audio files. Deployable entirely self-hosted via Docker Compose with no external paid dependencies.

No commits in the last 6 months.

Archived Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

GPL-3.0

Higher-rated alternatives

DiceTechJobs/VectorsInSearch

Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the...

IuriiD/pinecone-faiss-pgvector

Comparing vector DBs Pinecone, FAISS & pgvector in combination with OpenAI Embeddings for semantic search

lukovicaleksa/semantic-search-mongodb-fastapi

This project demonstrates how you can enhance standard CRUD operations in your application using...

DrRuin/Personalized-Real-Estate-Agent

In an industry where personalization is key to customer satisfaction, your company wants to...

nmdra/Semantic-Search

A semantic search system built with PostgreSQL and pgvector, powered by Gemini for generating...

Explore Embedding Tools

All categories Trending Embeddings directory Insights