osoleve/glitchlings
Enemies for your LLM
This project offers tools to intentionally introduce realistic text corruptions into your data. You provide original text, and it outputs versions with typos, homophone substitutions, or confusable characters. It's designed for machine learning engineers and researchers who are building and testing robust language models.
Available on PyPI.
Use this if you need to test how well your language models handle real-world data imperfections or want to train models to be more resilient to errors.
Not ideal if you're looking for tools to clean or correct existing noisy text data.
Stars
35
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 20, 2026
Commits (30d)
0
Dependencies
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/osoleve/glitchlings"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thunlp/OpenAttack
An Open-Source Package for Textual Adversarial Attack.
thunlp/TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
jind11/TextFooler
A Model for Natural Language Attack on Text Classification and Inference
thunlp/OpenBackdoor
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
thunlp/SememePSO-Attack
Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial...