lukeewin/FunASR_API

这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.

/ 100

Emerging

This project helps you accurately transcribe audio recordings, even when multiple people are speaking. You input an audio file or URL, and it outputs the spoken text, broken down by speaker and time. This is ideal for anyone needing to create text records of meetings, interviews, or any multi-speaker audio.

Use this if you need to convert audio containing multiple speakers into a detailed text transcript with speaker identification and timestamps.

Not ideal if you're looking for a user-friendly application with a graphical interface, as this project requires technical setup and API calls.

meeting-transcription interview-analysis audio-logging speech-to-text conversation-analysis

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 13 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

HTML

License

MIT

Higher-rated alternatives

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

yeyupiaoling/YeAudio

Python的音频工具

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

Explore Voice AI Tools

All categories Trending Voice AI directory Insights