wassemgtk/llm.scala

Extensible implementation of a Language Model (LLM) training framework in Scala.

/ 100

Emerging

This project offers a framework for developers to build, train, and evaluate custom large language models (LLMs) from scratch using Scala. It takes raw text data as input, processes it, and outputs a trained language model capable of generating new text or performing other language tasks. This tool is for software engineers and machine learning practitioners who need to implement or experiment with LLM architectures.

No commits in the last 6 months.

Use this if you are a developer looking to implement, customize, and train transformer-based language models in Scala, with fine-grained control over the architecture and training process.

Not ideal if you are an end-user seeking a ready-to-use LLM, a low-code solution, or if you prefer existing deep learning frameworks like TensorFlow or PyTorch for your model development.

language-model-development machine-learning-engineering natural-language-processing model-training deep-learning-implementation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Scala

License

MIT

Higher-rated alternatives

NX-AI/xlstm

Official repository of the xLSTM.

sinanuozdemir/oreilly-hands-on-gpt-llm

Mastering the Art of Scalable and Efficient AI Model Deployment

DashyDashOrg/pandas-llm

Pandas-LLM

wxhcore/bumblecore

An LLM training framework built from the ground up, featuring a custom BumbleBee architecture...

MiniMax-AI/MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model &...

Explore Transformer Models

All categories Trending Transformer directory Insights