Hamtech-ai/wav2vec2-fa

fine-tune Wav2vec2. an ASR model released by Facebook

35
/ 100
Emerging

This model helps you convert spoken Persian (Farsi) audio into written text. You provide audio files sampled at 16kHz, and it outputs the corresponding transcription. It's designed for anyone needing to accurately transcribe Persian speech, whether for documentation, analysis, or accessibility purposes.

No commits in the last 6 months.

Use this if you need highly accurate automatic transcription for Persian (Farsi) speech, especially if you have custom datasets to further refine its performance.

Not ideal if you need to transcribe speech in languages other than Persian, or if your audio is not sampled at 16kHz.

speech-to-text Persian-language audio-transcription language-processing voice-recognition
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 12 / 25

How are scores calculated?

Stars

36

Forks

5

Language

Jupyter Notebook

License

AGPL-3.0

Last pushed

Dec 11, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Hamtech-ai/wav2vec2-fa"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.