Hironsan/neraug
A text augmentation tool for named entity recognition.
When building systems that automatically extract specific information like names, locations, or organizations from text, you often need a lot of examples to train them effectively. This tool helps create more training examples from your existing annotated text by intelligently altering words or phrases. It takes your current text examples with their labeled entities and outputs new, varied examples, helping data scientists and NLP engineers improve the accuracy of their information extraction models.
No commits in the last 6 months. Available on PyPI.
Use this if you have a limited dataset of labeled text for named entity recognition and need to expand it to train more robust models.
Not ideal if you are looking for a pre-trained model or a tool that generates entirely new content rather than augmenting existing text.
Stars
54
Forks
2
Language
Python
License
MIT
Category
Last pushed
Jul 22, 2021
Commits (30d)
0
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Hironsan/neraug"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Hironsan/anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
jbesomi/texthero
Text preprocessing, representation and visualization from zero to hero.
hamelsmu/ktext
Utilities for preprocessing text for deep learning with Keras
asahi417/tner
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...