raymin0223/fast_robust_early_exit

Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)

/ 100

Emerging

This project helps machine learning engineers and researchers speed up how quickly large language models generate text for tasks like summarization, question answering, and translation. It takes an existing autoregressive language model and processes its output to make predictions faster without losing accuracy. The end user is a machine learning engineer or researcher working with large language models.

No commits in the last 6 months.

Use this if you need to accelerate the text generation process of large language models for tasks like summarization or translation, and you are comfortable working with machine learning model deployments.

Not ideal if you are looking for a no-code solution or are unfamiliar with integrating and fine-tuning deep learning models.

natural-language-generation large-language-models model-inference-optimization text-summarization machine-translation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

ShiZhengyan/InstructionModelling

[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"

SALT-NLP/Adaptive-Compositional-Modules

Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"

oooranz/Baby-CoThought

🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models (BabyLM Challenge)

joisino/zeh

Code for "Even GPT-5.2 Can’t Count to Five: The Case for Zero-Error Horizons in Trustworthy LLMs"

yhy1117/X-Mixup

Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".

Explore Transformer Models

All categories Trending Transformer directory Insights