shanghaimoon888/mod_vadasr
This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.
This FreeSWITCH module enhances your telephony system by automatically detecting when someone starts and stops speaking during a call and transcribing their speech. It takes real-time audio from calls, identifies speech segments, and outputs both the transcribed text and the corresponding audio recordings. This is ideal for call center managers, customer service operations, or anyone needing to automate call analysis and processing.
No commits in the last 6 months.
Use this if you need to automatically detect speech within your FreeSWITCH calls to trigger actions or to transcribe spoken words into text for analysis or record-keeping.
Not ideal if your primary need is offline audio transcription or if you require an ASR service other than iFLYTEK.
Stars
50
Forks
27
Language
C
License
—
Category
Last pushed
Jul 01, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shanghaimoon888/mod_vadasr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and...
ccoreilly/vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
alphacep/vosk
VOSK Speech Recognition Toolkit
alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
solyarisoftware/voskJs
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.