aerdem4/kaggle-quora-dup

Solution to Kaggle's Quora Duplicate Question Detection Competition

48
/ 100
Emerging

This solution helps data scientists or machine learning engineers efficiently identify duplicate questions from a large dataset. It takes pairs of natural language questions as input and determines whether they convey the same meaning, outputting a prediction for each pair. This is ideal for those working on natural language processing tasks who need to de-duplicate or group similar queries.

137 stars. No commits in the last 6 months.

Use this if you need a pre-built machine learning pipeline to detect semantic duplicates among questions, especially in a competitive or research setting.

Not ideal if you are looking for a simple, out-of-the-box API or a solution that doesn't require environment setup and execution of multiple Python scripts.

Natural Language Processing Question Answering Systems Duplicate Detection Text Similarity Machine Learning Competitions
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

137

Forks

51

Language

Python

License

MIT

Last pushed

Feb 18, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/aerdem4/kaggle-quora-dup"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.