wassemgtk/llm.scala
Extensible implementation of a Language Model (LLM) training framework in Scala.
This project offers a framework for developers to build, train, and evaluate custom large language models (LLMs) from scratch using Scala. It takes raw text data as input, processes it, and outputs a trained language model capable of generating new text or performing other language tasks. This tool is for software engineers and machine learning practitioners who need to implement or experiment with LLM architectures.
No commits in the last 6 months.
Use this if you are a developer looking to implement, customize, and train transformer-based language models in Scala, with fine-grained control over the architecture and training process.
Not ideal if you are an end-user seeking a ready-to-use LLM, a low-code solution, or if you prefer existing deep learning frameworks like TensorFlow or PyTorch for your model development.
Stars
34
Forks
3
Language
Scala
License
MIT
Category
Last pushed
Apr 09, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/wassemgtk/llm.scala"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NX-AI/xlstm
Official repository of the xLSTM.
sinanuozdemir/oreilly-hands-on-gpt-llm
Mastering the Art of Scalable and Efficient AI Model Deployment
DashyDashOrg/pandas-llm
Pandas-LLM
wxhcore/bumblecore
An LLM training framework built from the ground up, featuring a custom BumbleBee architecture...
MiniMax-AI/MiniMax-01
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model &...