makcedward/nlpaug
Data augmentation for NLP
This tool helps machine learning engineers and data scientists expand small datasets of text or audio. It takes your existing text sentences or audio clips and generates new, subtly varied versions. This process creates more training data, which can significantly improve the performance and robustness of your natural language processing or speech recognition models.
4,650 stars. Used by 6 other packages. No commits in the last 6 months. Available on PyPI.
Use this if you have a limited amount of labeled text or audio data and need to create more diverse examples to train better-performing AI models.
Not ideal if you're looking for a tool to perform data analysis, visualize data, or directly build a machine learning model, as this focuses solely on data augmentation.
Stars
4,650
Forks
473
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jun 24, 2024
Commits (30d)
0
Dependencies
4
Reverse dependents
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/makcedward/nlpaug"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
TorchIO-project/torchio
Medical imaging processing for AI applications.
aleju/imgaug
Image augmentation for machine learning experiments.
mdbloice/Augmentor
Image augmentation library in Python for machine learning.
BloodAxe/pytorch-toolbelt
PyTorch extensions for fast R&D prototyping and Kaggle farming
qubvel/ttach
Image Test Time Augmentation with PyTorch!