ASR-Wav2vec-Finetune and wav2vec2-fa
Both projects are fine-tuning Wav2vec2 for speech recognition, making them direct competitors for users seeking a pre-trained ASR model based on this architecture.
About ASR-Wav2vec-Finetune
khanld/ASR-Wav2vec-Finetune
:zap: Finetune Wa2vec 2.0 For Speech Recognition
This tool helps machine learning engineers and researchers adapt pre-trained speech recognition models to their specific audio datasets. You provide audio files along with their corresponding transcripts, and it produces a fine-tuned model capable of converting new audio into text. This is ideal for those who need highly accurate speech-to-text capabilities for specialized language, accents, or acoustic environments.
About wav2vec2-fa
Hamtech-ai/wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook
This model helps you convert spoken Persian (Farsi) audio into written text. You provide audio files sampled at 16kHz, and it outputs the corresponding transcription. It's designed for anyone needing to accurately transcribe Persian speech, whether for documentation, analysis, or accessibility purposes.
Scores updated daily from GitHub, PyPI, and npm data. How scores work