JamesLYC88/text_classification_baseline_code

The code for the ACL 2023 paper "Linear Classifier: An Often-Forgotten Baseline for Text Classification".

/ 100

Experimental

This project helps legal professionals, policymakers, and researchers compare different methods for automatically classifying legal documents. You input a collection of legal texts, such as court cases or regulatory documents, and the project outputs performance metrics for various text classification approaches like Linear SVM and BERT. It is designed for anyone working with large volumes of legal text who needs to understand and evaluate automated categorization methods.

No commits in the last 6 months.

Use this if you need to evaluate the effectiveness of different text classification models on legal documents to categorize them by topic, judgment type, or other attributes.

Not ideal if you are looking for a ready-to-use, production-grade legal document classification system without needing to compare or reproduce research results.

legal-tech document-classification legal-research regulatory-compliance legal-analytics

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

urchade/GLiNER

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from...

HySonLab/ViDeBERTa

ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023

lgalke/text-clf-baselines

WideMLP for Text Classification

stccenter/Comparative-Analysis-of-BERT-and-GPT-for-Classifying-Crisis-News-with-Sudan-Conflict-as-an-Example

Comparative Analysis of BERT and GPT for Conflict-Related Multiclass Label Classification from...

NLP-AI-Wizards/clef2025-checkthat

Challenge to distinguish whether a sentence from a news article expresses the subjective view of...

Explore NLP Tools

All categories Trending NLP directory Insights