wss1996/Name-disambiguation

同名论文消歧的工程化方案(参考2019智源-aminer人名消歧竞赛第一名方案)

33
/ 100
Emerging

This project helps research institutions and large academic publishers accurately identify unique authors within vast datasets of scientific publications, even when authors share the same name. It takes raw scientific paper metadata and author-to-paper records as input, and outputs a CSV file that uniquely identifies each author with a specific author_id. Research administrators, librarians, or data scientists managing large academic databases would find this useful.

No commits in the last 6 months.

Use this if you need to precisely distinguish between multiple authors who share identical names across millions of scientific papers to improve data accuracy and integrity.

Not ideal if you're dealing with smaller datasets or if you don't have the significant computational resources (150GB+ disk space, 16-core 64GB Linux server) and time (2-3 working days) required for processing.

academic-research bibliometrics author-profiling scientific-data-management research-data-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 10 / 25

How are scores calculated?

Stars

25

Forks

3

Language

Jupyter Notebook

License

Apache-2.0

Last pushed

Dec 08, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/wss1996/Name-disambiguation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.