zilliz-bootcamp/audio_search
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
This tool helps you organize and search large collections of audio files by their content, rather than just filenames. You provide a collection of audio clips (like sound effects or recordings), and the system processes them to understand what sounds they contain. Then, you can submit a new audio clip and find all the similar-sounding clips within your collection. This is ideal for sound engineers, game developers, or media librarians managing extensive audio libraries.
No commits in the last 6 months.
Use this if you need to quickly find specific types of sounds or similar audio clips within a massive archive, without manually tagging or listening to every file.
Not ideal if you only need to search audio files by metadata like filename, date, or explicit text tags, or if your audio collection is very small.
Stars
28
Forks
7
Language
Python
License
MIT
Category
Last pushed
Aug 10, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/zilliz-bootcamp/audio_search"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ssrajadh/sentrysearch
Semantic search over videos using Gemini Embedding 2.
hayabhay/frogbase
Transform audio-visual content into navigable knowledge.
kyegomez/Pegasus
PegasusX: The Future of Multimodal Embeddings 🦄 🦄
tomfalainen/word_spotting
Semantic and Verbatim Word Spotting in Torch
ashvardanian/SwiftSemanticSearch
Real-time on-device text-to-image and image-to-image Semantic Search with video stream camera...