WhisperLive and WhisperSpeech

These are complementary tools that form a bidirectional speech processing pipeline: WhisperLive enables real-time speech-to-text conversion while WhisperSpeech enables text-to-speech synthesis, allowing audio content to be transcribed and regenerated within a single workflow.

WhisperLive
68
Established
WhisperSpeech
50
Established
Maintenance 20/25
Adoption 10/25
Maturity 16/25
Community 22/25
Maintenance 6/25
Adoption 10/25
Maturity 16/25
Community 18/25
Stars: 3,894
Forks: 536
Downloads:
Commits (30d): 6
Language: Python
License: MIT
Stars: 4,575
Forks: 269
Downloads:
Commits (30d): 0
Language: Jupyter Notebook
License: MIT
No Package No Dependents
No Package No Dependents

About WhisperLive

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

This tool helps professionals instantly convert spoken language into written text, whether it's from a live microphone feed or a pre-recorded audio file. It takes your speech as input and provides accurate, real-time transcription as text output. Anyone who needs fast, reliable audio-to-text conversion for meetings, interviews, or content creation would find this useful.

live-captioning meeting-transcription audio-to-text content-creation interview-analysis

About WhisperSpeech

WhisperSpeech/WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

This project helps content creators, educators, and businesses generate high-quality, natural-sounding speech from written text. You provide text, and it produces an audio file of someone speaking that text. It's especially useful for quickly creating audio content or giving a unique voice to your digital applications.

audio-content-creation e-learning voice-over multilingual-communication digital-assistant-voices

Scores updated daily from GitHub, PyPI, and npm data. How scores work