Yinghao-Li/CHMM-ALT

Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"

/ 100

Emerging

This project helps researchers and data scientists automatically identify specific entities, such as disease names or product features, within large collections of text. It takes raw text from various sources and processes it using weak labels (less precise annotations) to produce a dataset with identified named entities. This is useful for anyone working with unstructured text data who needs to extract key information without extensive manual annotation.

No commits in the last 6 months.

Use this if you need to extract specific named entities from text, have access to multiple sources of weakly labeled data, and want to leverage advanced machine learning models for improved accuracy.

Not ideal if you have a small, perfectly labeled dataset or if you need to perform general text classification rather than named entity recognition.

natural-language-processing biomedical-text-mining information-extraction sentiment-analysis data-labeling

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

dnanhkhoa/python-vncorenlp

A Python wrapper for VnCoreNLP using a bidirectional communication channel.

datquocnguyen/RDRPOSTagger

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

OpenSextant/SolrTextTagger

A text tagger based on Lucene / Solr, using FST technology

ankane/informers

Fast transformer inference for Ruby

bentrevett/pytorch-pos-tagging

A tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.

Explore NLP Tools

All categories Trending NLP directory Insights