ASR-Wav2vec-Finetune and wav2vec2-fa

Both projects are fine-tuning Wav2vec2 for speech recognition, making them direct competitors for users seeking a pre-trained ASR model based on this architecture.

ASR-Wav2vec-Finetune
38
Emerging
wav2vec2-fa
35
Emerging
Maintenance 0/25
Adoption 10/25
Maturity 8/25
Community 20/25
Maintenance 0/25
Adoption 7/25
Maturity 16/25
Community 12/25
Stars: 149
Forks: 32
Downloads:
Commits (30d): 0
Language: Python
License:
Stars: 36
Forks: 5
Downloads:
Commits (30d): 0
Language: Jupyter Notebook
License: AGPL-3.0
No License Stale 6m No Package No Dependents
Stale 6m No Package No Dependents

About ASR-Wav2vec-Finetune

khanld/ASR-Wav2vec-Finetune

:zap: Finetune Wa2vec 2.0 For Speech Recognition

This tool helps machine learning engineers and researchers adapt pre-trained speech recognition models to their specific audio datasets. You provide audio files along with their corresponding transcripts, and it produces a fine-tuned model capable of converting new audio into text. This is ideal for those who need highly accurate speech-to-text capabilities for specialized language, accents, or acoustic environments.

speech-to-text audio-transcription natural-language-processing machine-learning-engineering custom-voice-recognition

About wav2vec2-fa

Hamtech-ai/wav2vec2-fa

fine-tune Wav2vec2. an ASR model released by Facebook

This model helps you convert spoken Persian (Farsi) audio into written text. You provide audio files sampled at 16kHz, and it outputs the corresponding transcription. It's designed for anyone needing to accurately transcribe Persian speech, whether for documentation, analysis, or accessibility purposes.

speech-to-text Persian-language audio-transcription language-processing voice-recognition

Scores updated daily from GitHub, PyPI, and npm data. How scores work