aerdem4/kaggle-quora-dup
Solution to Kaggle's Quora Duplicate Question Detection Competition
This solution helps data scientists or machine learning engineers efficiently identify duplicate questions from a large dataset. It takes pairs of natural language questions as input and determines whether they convey the same meaning, outputting a prediction for each pair. This is ideal for those working on natural language processing tasks who need to de-duplicate or group similar queries.
137 stars. No commits in the last 6 months.
Use this if you need a pre-built machine learning pipeline to detect semantic duplicates among questions, especially in a competitive or research setting.
Not ideal if you are looking for a simple, out-of-the-box API or a solution that doesn't require environment setup and execution of multiple Python scripts.
Stars
137
Forks
51
Language
Python
License
MIT
Category
Last pushed
Feb 18, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/aerdem4/kaggle-quora-dup"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
LittletreeZou/Question-Pairs-Matching
第三届魔镜杯 智能客服问题相似性算法设计 第12名解决方案
howardyclo/Kaggle-Quora-Question-Pairs
This is our team's solution report, which achieves top 10% (305/3307) in this competition.
aspk/Quora_question_pairs_NLP_Kaggle
Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn...
shubh24/QuoraDuplicate
Identify duplicate questions on Quora
goodmattg/quora_kaggle
Storage for Kaggle Quora competition