egorsmkv/audio-katana

A tool to slice your audio files into chunks using the Voice Activity Detection technique

/ 100

Experimental

This tool helps you automatically split long audio recordings into smaller, manageable segments based on when actual speech or sound is detected. You provide a folder of audio files, and it returns a new folder with individual sound clips, each containing a distinct audio event. It's ideal for anyone who needs to process or analyze spoken content without silence or background noise.

No commits in the last 6 months.

Use this if you need to quickly break down recordings like interviews, lectures, or calls into discrete, shorter audio snippets for easier review or transcription.

Not ideal if you need to precisely edit audio based on specific timestamps or musical beats, as it focuses on voice activity rather than general audio editing.

audio-transcription speech-analysis podcast-editing call-center-analytics voice-recording-management

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

speechmatics/speechmatics-python

Python library and CLI for Speechmatics

gooofy/py-nltools

A collection of basic python modules for spoken natural language processing

IBM/MAX-Speech-to-Text-Converter

Converts spoken words into text form.

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...

snakers4/open_stt

Open STT

Explore Voice AI Tools

All categories Trending Voice AI directory Insights