syedsalman137/mamba-inference-in-c
Inference mamba in one file of C
This project helps software developers integrate high-performance Mamba language models directly into their C applications. It takes pre-trained Mamba model files and a tokenizer as input, allowing the application to generate text or engage in chat interactions efficiently. This tool is designed for C/C++ developers who are building applications that require fast, localized AI text generation capabilities.
No commits in the last 6 months.
Use this if you are a C/C++ developer who needs to run Mamba large language models for text generation or chatbot functionality directly within your application, without relying on Python environments or larger frameworks.
Not ideal if you are not a C/C++ developer, or if you need to train Mamba models rather than just performing inference with pre-trained ones.
Stars
8
Forks
1
Language
C
License
MIT
Category
Last pushed
Jan 28, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/syedsalman137/mamba-inference-in-c"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Dobiasd/frugally-deep
A lightweight header-only library for using Keras (TensorFlow) models in C++.
flashlight/flashlight
A C++ standalone library for machine learning
NVlabs/tiny-cuda-nn
Lightning fast C++/CUDA neural network framework
KasperskyLab/knp
Kaspersky Neuromorphic Platform
wichtounet/dll
Fast Deep Learning Library (DLL) for C++ (ANNs, CNNs, RBMs, DBNs...)