guxm2021/ALT_SpeechBrain

[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription

/ 100

Emerging

This project helps music professionals, researchers, and anyone working with vocal recordings to accurately transcribe sung lyrics into text. It takes audio files of singing as input and outputs the corresponding lyrics, even when the singing is difficult to understand. It is designed for those who need to convert sung vocals into written words for analysis, indexing, or other applications.

No commits in the last 6 months.

Use this if you need to automatically generate accurate text transcripts from vocal performances, especially when dealing with large volumes of audio or challenging singing styles.

Not ideal if you primarily need to transcribe spoken dialogue rather than sung lyrics, as dedicated speech-to-text tools might be more suitable.

music-transcription vocal-analysis music-information-retrieval audio-processing lyric-generation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Related models

subhasis-ai/Hindi-ASR-Wav2Vec2

This repository demonstrates development of Hindi ASR model using transformers.

guxm2021/MM_ALT

[MM 2022] MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral, Top paper award)

jvel07/wav2vec2_patho

Fine-tuning wav2vec2 to for Pathological Speech Processing

hammaad2002/ASRAdversarialAttacks

An ASR (Automatic Speech Recognition) adversarial attack repository.

maximkm/DLA_ASR_HW

ASR pytorch project

Explore Transformer Models

All categories Trending Transformer directory Insights