rbitr/llm.f90

LLM inference in Fortran

/ 100

Emerging

This project allows developers to run large language models (LLMs) on their own computers using Fortran. It takes a pre-trained LLM model file (like a GGUF file) and a text prompt as input, then generates text completions. The output is the generated text and performance metrics. This is for developers or researchers who want direct control over LLM inference on CPU without complex frameworks.

No commits in the last 6 months.

Use this if you are a developer who needs to run LLM inference on a CPU with minimal dependencies, desire high performance from a simple, hackable codebase, and want to integrate or customize the language model at a low level.

Not ideal if you are a non-developer seeking an out-of-the-box application for general LLM use without programming, or if you require extensive multi-platform support or GPU acceleration directly from this tool.

LLM-inference-development CPU-optimization scientific-computing custom-language-models embedded-AI

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Fortran

License

MIT

Higher-rated alternatives

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

alibaba/MNN

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering...

xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source,...

tensorzero/tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM...

Explore Transformer Models

All categories Trending Transformer directory Insights