Qwen-Applications/STAR

STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models

/ 100

Experimental

This project helps create very small, efficient AI models that can understand and execute specific commands or "function calls" from users. It takes a large, capable AI model and a collection of example interactions, then distills that knowledge into a much smaller, faster model. The end result is a compact AI that can accurately interpret user requests to perform actions, ideal for deployment on devices with limited resources or for scenarios where quick, specialized responses are needed. This is for AI developers and researchers who need to deploy function-calling AI at a tiny scale.

Use this if you need to build a highly compact, cost-effective AI model that can accurately interpret and respond to user requests by calling specific functions or tools, while maintaining performance close to much larger models.

Not ideal if you are looking for a general-purpose large language model for broad conversational tasks rather than specialized function calling.

AI-model-compression function-calling on-device-AI efficient-AI language-model-distillation

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 11 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Higher-rated alternatives

scaleapi/llm-engine

Scale LLM Engine public repository

AGI-Arena/MARS

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

modelscope/easydistill

a toolkit on knowledge distillation for large language models

AGI-Edgerunners/LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient...

Wang-ML-Lab/bayesian-peft

Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]

Explore Transformer Models

All categories Trending Transformer directory Insights