JamesLYC88/text_classification_baseline_code
The code for the ACL 2023 paper "Linear Classifier: An Often-Forgotten Baseline for Text Classification".
This project helps legal professionals, policymakers, and researchers compare different methods for automatically classifying legal documents. You input a collection of legal texts, such as court cases or regulatory documents, and the project outputs performance metrics for various text classification approaches like Linear SVM and BERT. It is designed for anyone working with large volumes of legal text who needs to understand and evaluate automated categorization methods.
No commits in the last 6 months.
Use this if you need to evaluate the effectiveness of different text classification models on legal documents to categorize them by topic, judgment type, or other attributes.
Not ideal if you are looking for a ready-to-use, production-grade legal document classification system without needing to compare or reproduce research results.
Stars
19
Forks
2
Language
Python
License
—
Category
Last pushed
Jun 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/JamesLYC88/text_classification_baseline_code"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
urchade/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from...
HySonLab/ViDeBERTa
ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023
lgalke/text-clf-baselines
WideMLP for Text Classification
stccenter/Comparative-Analysis-of-BERT-and-GPT-for-Classifying-Crisis-News-with-Sudan-Conflict-as-an-Example
Comparative Analysis of BERT and GPT for Conflict-Related Multiclass Label Classification from...
NLP-AI-Wizards/clef2025-checkthat
Challenge to distinguish whether a sentence from a news article expresses the subjective view of...