sebischair/FusionSent

Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes"

31
/ 100
Emerging

This project helps researchers and librarians automatically categorize scientific papers, articles, or other long-form documents. You input the text of a document (like a title and abstract) and it outputs the relevant scientific categories, even if you only have a few examples for each category. This is ideal for academics, research institutions, and digital libraries managing large collections of scientific literature.

No commits in the last 6 months. Available on PyPI.

Use this if you need to accurately assign multiple subject categories to scientific documents, especially when you have many different categories and limited labeled examples for each.

Not ideal if your documents are not scientific or if you have a vast amount of labeled data for every single category.

scientific-document-classification academic-publishing research-indexing digital-libraries knowledge-management
Stale 6m
Maintenance 0 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 0 / 25

How are scores calculated?

Stars

17

Forks

Language

Python

License

Apache-2.0

Last pushed

Jan 07, 2025

Commits (30d)

0

Dependencies

51

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sebischair/FusionSent"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.