m96-chan/0xBitNet

Run BitNet b1.58 ternary LLMs with WebGPU — in browsers and native apps

/ 100

Emerging

This project allows developers to integrate small, efficient AI language models directly into web applications or desktop software. It takes pre-trained BitNet b1.58 or Falcon-E models as input and outputs generated text, enabling features like real-time chat or content summarization without needing a dedicated server. This is for software developers creating applications that require on-device AI text generation.

Available on npm.

Use this if you are a developer looking to add fast, locally-run AI language model capabilities to your browser-based or native applications.

Not ideal if you need to run large, complex AI models or require extensive customization beyond what's offered by the supported BitNet architecture.

AI application development on-device inference edge AI browser AI text generation

No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 20 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

TypeScript

License

MIT

Higher-rated alternatives

huawei-csl/SINQ

Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method...

SILX-LABS/QUASAR-SUBNET

QUASAR is a long-context foundation model and decentralized evaluation subnet built on Bittensor,

stackblogger/bitnet.js

BitNet.Js - A node.js implementation of the microsoft bitnet.cpp inference framework.

AnswerDotAI/cold-compress

Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking...

FMInference/H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Explore LLM Tools

All categories Trending LLM Tool directory Insights