slp-rl/slamkit

SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

/ 100

Emerging

SlamKit helps researchers and engineers quickly train and evaluate Speech Language Models using audio-only data. It takes raw audio files or pre-processed audio features and outputs a trained SpeechLM, ready for further use. This toolkit is designed for anyone working on speech recognition, synthesis, or understanding who needs an efficient way to develop and fine-tune large speech models.

229 stars. No commits in the last 6 months.

Use this if you are a machine learning researcher or engineer developing Speech Language Models and want to significantly accelerate the training and evaluation process, even on a single GPU.

Not ideal if you are looking for a simple, off-the-shelf solution to use a pre-trained model for inference without needing to train or fine-tune one yourself.

speech-recognition-research audio-language-modeling machine-learning-engineering speech-synthesis AI-model-training

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

229

Forks

Language

Python

License

MIT

Higher-rated alternatives

TsinghuaC3I/MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

cli99/llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

tanyuqian/redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to...

stanleylsx/llms_tool

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

Explore Transformer Models

All categories Trending Transformer directory Insights