sagorbrur/codeswitch
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
When analyzing social media posts, customer feedback, or other text that mixes English with Hindi, Nepali, or Spanish, this tool helps you understand it better. It takes in sentences with mixed languages and can identify which words belong to which language, tag parts of speech, recognize named entities, and determine the sentiment of the text. This is useful for researchers, marketers, or data analysts working with multilingual text data.
No commits in the last 6 months. Available on PyPI.
Use this if you need to extract meaning, identify languages, or gauge sentiment from text that blends English with Spanish, Hindi, or Nepali.
Not ideal if your text involves languages other than Spanish, Hindi, or Nepali mixed with English, or if you need to analyze entire documents rather than short text snippets.
Stars
37
Forks
6
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Nov 02, 2020
Commits (30d)
0
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sagorbrur/codeswitch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
UBC-NLP/turjuman
TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).
nusnlp/esc
The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper
nusnlp/greco
The official code for the "System Combination via Quality Estimation for Grammatical Error...
mynlp/pnmt
Pre-train support for OpenNMT (PNMT)