lingo-iitgn/awesome-code-mixing
A curated list of resources dedicated to Code-mixed Natural Language Processing (NLP).
This is a curated collection of research papers, datasets, and software tools specifically focused on processing and understanding 'code-mixed' and 'code-switched' language. It brings together resources for tasks like identifying languages within a sentence, translating, or performing sentiment analysis on text that blends multiple languages. Anyone working with multilingual text, especially in online conversations or social media, would find this useful for building robust language technologies.
Use this if you are a researcher or practitioner building AI models for text that frequently combines words or phrases from different languages, such as social media posts or informal dialogues.
Not ideal if you primarily work with text in a single language or are looking for a ready-to-use, off-the-shelf software application.
Stars
11
Forks
—
Language
—
License
Apache-2.0
Category
Last pushed
Jan 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/lingo-iitgn/awesome-code-mixing"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
RichardLitt/low-resource-languages
Resources for conservation, development, and documentation of low resource (human) languages.
UCREL/pymusas-models
PyMUSAS Models
ksopyla/awesome-nlp-polish
A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models,...
datanada/Awesome-Korean-NLP
A curated list of resources for NLP (Natural Language Processing) for Korean