lissettecarlr/AutomaticSpeechRecognition
语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库
This tool helps convert spoken language from audio files into written text. You provide an audio recording, and it outputs the corresponding transcription. It's designed for developers who need to integrate various speech-to-text functionalities into their Python applications.
No commits in the last 6 months.
Use this if you are a Python developer building applications that require converting audio to text and want a flexible way to choose between different underlying speech recognition engines.
Not ideal if you are an end-user looking for a ready-to-use application with a graphical interface for transcribing audio.
Stars
8
Forks
—
Language
Python
License
—
Category
Last pushed
Feb 13, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lissettecarlr/AutomaticSpeechRecognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
yeyupiaoling/YeAudio
Python的音频工具
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端