slp-rl/slamkit

SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

39
/ 100
Emerging

SlamKit helps researchers and engineers quickly train and evaluate Speech Language Models using audio-only data. It takes raw audio files or pre-processed audio features and outputs a trained SpeechLM, ready for further use. This toolkit is designed for anyone working on speech recognition, synthesis, or understanding who needs an efficient way to develop and fine-tune large speech models.

229 stars. No commits in the last 6 months.

Use this if you are a machine learning researcher or engineer developing Speech Language Models and want to significantly accelerate the training and evaluation process, even on a single GPU.

Not ideal if you are looking for a simple, off-the-shelf solution to use a pre-trained model for inference without needing to train or fine-tune one yourself.

speech-recognition-research audio-language-modeling machine-learning-engineering speech-synthesis AI-model-training
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 11 / 25

How are scores calculated?

Stars

229

Forks

14

Language

Python

License

MIT

Last pushed

May 18, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/slp-rl/slamkit"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.