prrao87/patent-classification

Classify international patents into one of eight categories based on the text of their titles & abstracts using DistilBert & ONNX Runtime

36
/ 100
Emerging

This project helps patent analysts, intellectual property lawyers, and researchers automatically categorize international patents into one of eight main fields like 'Human Necessities' or 'Chemistry'. You provide raw patent XML files, and it processes their titles and abstracts to assign a relevant top-level classification. The output helps streamline the review and organization of large patent datasets.

No commits in the last 6 months.

Use this if you need to quickly sort a large collection of international patents into broad, pre-defined subject matter categories based on their text.

Not ideal if you need to classify patents into very granular subcategories, as it only addresses the top-level classification (A-H).

patent-analysis intellectual-property document-classification technical-text-processing information-retrieval
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

10

Forks

4

Language

Python

License

MIT

Last pushed

Jul 21, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/prrao87/patent-classification"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.