winkjs/wink-jaro-distance

An Implementation of Jaro Distance Algorithm by Matthew A. Jaro

46
/ 100
Emerging

This tool helps data analysts and engineers identify and deduplicate short strings like names or product IDs in datasets. You input two strings, and it outputs a score indicating how similar they are. This is particularly useful for cleaning up inconsistent data entries where minor typos or variations exist.

No commits in the last 6 months. Available on npm.

Use this if you need to find highly similar but not identical short strings to merge records or clean data.

Not ideal if you need to compare very long texts or complex phrases, as it's optimized for shorter strings.

data-cleaning data-deduplication record-matching data-quality entity-resolution
Stale 6m No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 25 / 25
Community 16 / 25

How are scores calculated?

Stars

13

Forks

6

Language

JavaScript

License

MIT

Last pushed

Aug 11, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/winkjs/wink-jaro-distance"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.