nicolasassi/gomtch
Find text even if it doesn't want to be found
This tool helps you accurately find specific words or phrases within large amounts of text, even if they've been deliberately obscured or misspelled. It takes your text and the terms you're looking for, then identifies all instances, including variations like split words ("s p l i t") or repeated characters ("reeeal"). This is ideal for content moderators, compliance officers, or researchers who need to identify sensitive or particular language.
No commits in the last 6 months.
Use this if you need to reliably detect specific terms in text that might be intentionally disguised through typos, spacing, or special characters, without having to manually normalize the text first.
Not ideal if you only need exact string matches or basic keyword searches, as its advanced matching features might be more than necessary.
Stars
30
Forks
1
Language
Go
License
BSD-3-Clause
Category
Last pushed
Aug 11, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/nicolasassi/gomtch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ikawaha/kagome-dict
Dictionary Library for Kagome v2
aaaton/golem
A lemmatizer implemented in Go
habeanf/yap
Yet Another (natural language) Parser
clipperhouse/uax29
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.
jdkato/prose
:book: A Golang library for text processing, including tokenization, part-of-speech tagging, and...