egorsmkv/audio-katana
A tool to slice your audio files into chunks using the Voice Activity Detection technique
This tool helps you automatically split long audio recordings into smaller, manageable segments based on when actual speech or sound is detected. You provide a folder of audio files, and it returns a new folder with individual sound clips, each containing a distinct audio event. It's ideal for anyone who needs to process or analyze spoken content without silence or background noise.
No commits in the last 6 months.
Use this if you need to quickly break down recordings like interviews, lectures, or calls into discrete, shorter audio snippets for easier review or transcription.
Not ideal if you need to precisely edit audio based on specific timestamps or musical beats, as it focuses on voice activity rather than general audio editing.
Stars
10
Forks
2
Language
Python
License
—
Category
Last pushed
Feb 13, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/egorsmkv/audio-katana"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing
IBM/MAX-Speech-to-Text-Converter
Converts spoken words into text form.
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...
snakers4/open_stt
Open STT