EdVince/whisper-trtllm
Whisper in TensorRT-LLM
This project helps developers and MLOps engineers optimize the deployment of OpenAI's Whisper speech-to-text models. It takes the original Whisper model and converts it into an optimized format for faster inference on NVIDIA GPUs. The output is a high-performance speech recognition engine that maintains accuracy while significantly speeding up transcription tasks. This is ideal for those building or deploying speech recognition applications at scale.
No commits in the last 6 months.
Use this if you are a developer or MLOps engineer looking to deploy OpenAI's Whisper model for English speech recognition with improved inference speed and efficiency on NVIDIA TensorRT-LLM.
Not ideal if you are an end-user simply looking for a speech-to-text application without needing to optimize model deployment or if you require support for languages other than English.
Stars
17
Forks
2
Language
C++
License
—
Category
Last pushed
Sep 21, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/EdVince/whisper-trtllm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Arkapravo-Ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
biodatlab/thonburian-whisper
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:
scalable-ml-deep-learning/fine_tune_whisper
Fine-Tune Whisper for Italian ASR with transformers
Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages
This project presents a multilingual pipeline for both speech-to-text and speech-to-speech...
mahiiyh/asr-primer
A complete implementation of an Automatic Speech Recognition (ASR) system using OpenAI's Whisper...