prrao87/patent-classification
Classify international patents into one of eight categories based on the text of their titles & abstracts using DistilBert & ONNX Runtime
This project helps patent analysts, intellectual property lawyers, and researchers automatically categorize international patents into one of eight main fields like 'Human Necessities' or 'Chemistry'. You provide raw patent XML files, and it processes their titles and abstracts to assign a relevant top-level classification. The output helps streamline the review and organization of large patent datasets.
No commits in the last 6 months.
Use this if you need to quickly sort a large collection of international patents into broad, pre-defined subject matter categories based on their text.
Not ideal if you need to classify patents into very granular subcategories, as it only addresses the top-level classification (A-H).
Stars
10
Forks
4
Language
Python
License
MIT
Category
Last pushed
Jul 21, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/prrao87/patent-classification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
yongzhuo/Pytorch-NLU
中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text...
aniass/Product-Categorization-NLP
Multi-Class Text Classification for products based on their description with Machine Learning...
VinAIResearch/BERTweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
hppRC/bert-classification-tutorial
【2023年版】BERTによるテキスト分類
zhanlaoban/Transformers_for_Text_Classification
基于Transformers的文本分类