easonanalytica/company_name_matcher
A library for matching and comparing company names using a fine-tuned sentence transformer model
This tool helps consolidate disparate records by accurately matching company names that might be spelled or phrased differently across various data sources (e.g., "Apple Inc" vs. "Apple Incorporated"). You provide a list of company names, and it identifies which names refer to the same entity, even across multiple languages. It's designed for data analysts, portfolio managers, supply chain professionals, and market intelligence specialists.
Available on PyPI.
Use this if you need to reliably link company records from multiple sources to gain a unified view for analytics, reporting, or operational purposes, especially with large datasets or multilingual names.
Not ideal if your matching needs are for very small, one-off comparisons where simple string matching is sufficient and performance for large datasets is not a concern.
Stars
9
Forks
1
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Dec 18, 2025
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/easonanalytica/company_name_matcher"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
winkjs/wink-nlp
Developer friendly Natural Language Processing ✨
LSYS/LexicalRichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
mbejda/Node-OpenNLP
Apache OpenNLP wrapper for Nodejs
LanguageMachines/frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for...
winkjs/wink-nlp-utils
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic...