Hamtech-ai/wav2vec2-fa

fine-tune Wav2vec2. an ASR model released by Facebook

/ 100

Emerging

This model helps you convert spoken Persian (Farsi) audio into written text. You provide audio files sampled at 16kHz, and it outputs the corresponding transcription. It's designed for anyone needing to accurately transcribe Persian speech, whether for documentation, analysis, or accessibility purposes.

No commits in the last 6 months.

Use this if you need highly accurate automatic transcription for Persian (Farsi) speech, especially if you have custom datasets to further refine its performance.

Not ideal if you need to transcribe speech in languages other than Persian, or if your audio is not sampled at 16kHz.

speech-to-text Persian-language audio-transcription language-processing voice-recognition

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

AGPL-3.0

Compare

wav2vec2-fa and ASR-Wav2vec-Finetune

Higher-rated alternatives

liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights