neo-search/multiple-string-searcher
Java library for fast multiple strings matchings. Uses internally Aho-Corasick or Commentz-Walter.
This is a Java library designed for developers who need to quickly find many specific words or phrases within a larger text. You provide a list of search terms (a 'dictionary') and an input text, and it efficiently returns all occurrences of your terms, including their positions. Developers building applications that require fast text scanning or named entity recognition will find this useful.
No commits in the last 6 months.
Use this if you are a Java developer building an application that needs to quickly identify multiple predefined strings or patterns within large blocks of text, such as for content filtering or information extraction.
Not ideal if you are not a Java developer or if you only need to search for a single string at a time, as simpler methods would suffice.
Stars
18
Forks
4
Language
Java
License
Apache-2.0
Category
Last pushed
Oct 11, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/neo-search/multiple-string-searcher"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
apache/opennlp
Apache OpenNLP
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing,...
dkpro/dkpro-core
Collection of software components for natural language processing (NLP) based on the Apache UIMA...
stanfordnlp/python-stanford-corenlp
Python interface to CoreNLP using a bidirectional server-client interface.
apache/opennlp-sandbox
Apache OpenNLP Sandbox