EdVince/whisper-trtllm

Whisper in TensorRT-LLM

/ 100

Experimental

This project helps developers and MLOps engineers optimize the deployment of OpenAI's Whisper speech-to-text models. It takes the original Whisper model and converts it into an optimized format for faster inference on NVIDIA GPUs. The output is a high-performance speech recognition engine that maintains accuracy while significantly speeding up transcription tasks. This is ideal for those building or deploying speech recognition applications at scale.

No commits in the last 6 months.

Use this if you are a developer or MLOps engineer looking to deploy OpenAI's Whisper model for English speech recognition with improved inference speed and efficiency on NVIDIA TensorRT-LLM.

Not ideal if you are an end-user simply looking for a speech-to-text application without needing to optimize model deployment or if you require support for languages other than English.

speech-to-text model-optimization GPU-acceleration MLOps AI-deployment

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

C++

License

—

Higher-rated alternatives

Arkapravo-Ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:

scalable-ml-deep-learning/fine_tune_whisper

Fine-Tune Whisper for Italian ASR with transformers

Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages

This project presents a multilingual pipeline for both speech-to-text and speech-to-speech...

mahiiyh/asr-primer

A complete implementation of an Automatic Speech Recognition (ASR) system using OpenAI's Whisper...

Explore Transformer Models

All categories Trending Transformer directory Insights