toltoxgh/CoreNLP-jMWE
Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations
This tool helps researchers and analysts automatically find common phrases and multi-word expressions like "Las Vegas" or "world record" within plain text. You input a text document, and it outputs a list of these identified expressions, which can be crucial for understanding nuanced meaning or improving text analysis. It's ideal for anyone working with large volumes of text who needs to go beyond single words to grasp the full context.
No commits in the last 6 months.
Use this if you need to accurately identify and extract fixed phrases and collocations from text to enhance natural language processing tasks like topic modeling or information retrieval.
Not ideal if your primary goal is basic keyword extraction or if you are not already using or planning to integrate with the Stanford CoreNLP framework.
Stars
15
Forks
8
Language
Java
License
GPL-3.0
Category
Last pushed
Jan 06, 2017
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/toltoxgh/CoreNLP-jMWE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
apache/opennlp
Apache OpenNLP
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing,...
stanfordnlp/python-stanford-corenlp
Python interface to CoreNLP using a bidirectional server-client interface.
dkpro/dkpro-core
Collection of software components for natural language processing (NLP) based on the Apache UIMA...
apache/opennlp-sandbox
Apache OpenNLP Sandbox