yh1008/speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

/ 100

Emerging

This project helps people who mix English and Chinese in the same sentences when speaking, like many bilingual individuals. It takes audio recordings that contain both languages spoken together and accurately transcribes them into text. The output is a written record of the mixed-language speech, useful for anyone who needs to convert their bilingual conversations into text.

No commits in the last 6 months.

Use this if you need to accurately convert spoken audio containing both Chinese and English in the same sentences into a written transcript.

Not ideal if your audio is purely monolingual (only English or only Chinese) or if you require translation between languages rather than transcription of mixed speech.

bilingual-communication speech-to-text mixed-language-transcription audio-processing multilingual-messaging

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 19 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights