chenkovsky/cyac
High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementation!
This tool helps developers efficiently find and replace specific keywords or phrases within large blocks of text. You provide a list of terms, and it quickly identifies all occurrences, even handling case-insensitive matches accurately. It's ideal for developers building applications that require fast text scanning, content moderation, or data anonymization.
No commits in the last 6 months.
Use this if you need to perform high-speed keyword matching and replacement in text, especially with a large number of search terms or very long documents, and need precise control over character offsets.
Not ideal if you're looking for fuzzy matching, complex pattern recognition (like regular expressions for non-keyword tasks), or a ready-to-use application rather than a programming library.
Stars
94
Forks
15
Language
Cython
License
MIT
Category
Last pushed
Oct 16, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/chenkovsky/cyac"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
winkjs/wink-nlp
Developer friendly Natural Language Processing ✨
LSYS/LexicalRichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
mbejda/Node-OpenNLP
Apache OpenNLP wrapper for Nodejs
LanguageMachines/frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for...
winkjs/wink-nlp-utils
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic...