salute-developers/GigaAM

Foundational Model for Speech Recognition Tasks

/ 100

Verified

GigaAM is a collection of powerful speech recognition and emotion detection models primarily designed for the Russian language. It takes spoken audio as input and can transcribe it into text, including punctuation and word-level timestamps, or identify emotions expressed in the speech. This tool is ideal for developers and researchers working on applications involving Russian voice assistants, call center analytics, or content transcription.

504 stars. Actively maintained with 9 commits in the last 30 days. Available on PyPI.

Use this if you need highly accurate speech-to-text transcription or emotion recognition for Russian-language audio, especially for challenging or diverse speech datasets.

Not ideal if your primary need is for languages other than Russian, or if you are looking for a ready-to-use application rather than a foundational model to integrate into a system.

speech-to-text voice-processing call-center-analytics emotion-recognition Russian-language

Maintenance 17 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 21 / 25

How are scores calculated?

Stars

504

Forks

Language

Python

License

MIT

Related tools

SuyashMore/MevonAI-Speech-Emotion-Recognition

Identify the emotion of multiple speakers in an Audio Segment

NotAbhinavGamerz/emotion-aware-automatic-speech-recognition

🎤 Enhance speech recognition by detecting emotions in spoken language, combining OpenAI's...

jsugg/ser

The AI-powered ser Python package is a tool for recognizing and analyzing emotions in speech....

saky-semicolon/Emotion-Aware-AI-Support-System

A smart AI-powered platform that detects emotions from student voice input, classifies their...

AkishinoShiame/Chinese-Speech-Emotion-Datasets

Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights