Sreyan88/DALE

Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP

/ 100

Emerging

This tool helps legal professionals, researchers, or legal tech developers working with limited legal text data. It takes existing legal documents or case texts and generates new, diverse variations of these documents. This augmentation helps improve the performance of machine learning models used for tasks like legal document classification or information extraction, even when initial data is scarce.

No commits in the last 6 months.

Use this if you need to train or improve an NLP model on legal texts but have a small amount of annotated data, and traditional data augmentation methods aren't effective for complex legal language.

Not ideal if you're looking for a direct, out-of-the-box solution for legal text analysis without any programming or machine learning model integration.

legal-tech legal-research document-analysis regulatory-compliance case-management

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

sdv-dev/SDV

Synthetic data generation for tabular data

sdv-dev/SDGym

Benchmarking synthetic data generation methods.

NVIDIA-NeMo/DataDesigner

🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch...

AlexanderVNikitin/tsgm

Generation and evaluation of synthetic time series datasets (also, augmentations,...

mostly-ai/mostlyai

Synthetic Data SDK ✨

Explore Generative AI Tools

All categories Trending Generative AI directory Insights