WhereIsAI/BiLLM

Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with 🤗 transformers.

/ 100

Emerging

This tool helps AI/ML engineers modify large language models (LLMs) to improve their performance on specific tasks like text classification and generating sentence embeddings. It takes existing uni-directional LLMs (like LLaMA or Mistral) and converts them to a bi-directional format, producing more accurate models for certain applications. This is designed for AI/ML engineers working on fine-tuning LLMs for downstream tasks.

No commits in the last 6 months.

Use this if you are an AI/ML engineer who needs to enhance the performance of specific large language models for tasks like text classification, named entity recognition, or creating sentence embeddings.

Not ideal if you are an end-user looking for a ready-to-use application, or if you are not comfortable working with model fine-tuning and the Hugging Face `transformers` library.

natural-language-processing large-language-models machine-learning-engineering text-classification named-entity-recognition

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...

arcee-ai/mergekit

Tools for merging pretrained large language models.

changyeyu/LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

Explore Transformer Models

All categories Trending Transformer directory Insights