uw-swag/tokdrift

Repository for TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar.

/ 100

Experimental

TokDrift is a research framework for evaluating how changes in code style or grammar (like using `snake_case` vs. `camelCase` for variables, or modifying punctuation) affect the performance of large language models on coding tasks. It takes a specific code transformation rule and a code-related task (e.g., code generation, fixing tests) as input, and outputs metrics showing how the LLM's accuracy is impacted. This tool is for researchers and developers working on code-generating LLMs.

Use this if you are a researcher or developer who needs to systematically evaluate how semantic-preserving code rewrites, such as changes in naming conventions or operator spacing, influence the accuracy and robustness of large language models on various coding tasks.

Not ideal if you are a practitioner looking for a tool to refactor your existing codebase or automatically improve code style; this is a research framework for analyzing LLM behavior, not a production code refactoring tool.

LLM evaluation code generation programming language research software engineering research natural language processing

No License No Package No Dependents

Maintenance 6 / 25

Adoption 5 / 25

Maturity 7 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

Goekdeniz-Guelmez/mlx-lm-lora

Train Large Language Models on MLX.

uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

VHellendoorn/Code-LMs

Guide to using pre-trained large language models of source code

ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

jarobyte91/pytorch_beam_search

A lightweight implementation of Beam Search for sequence models in PyTorch.

Explore Transformer Models

All categories Trending Transformer directory Insights