MohammedAly22/Semantify

A detailed comparison between 3 different techniques (TF-IDF, Doc2Vec, and Sentence Transformers) for performing semantic search on a huge dataset

12
/ 100
Experimental

This project helps you find the most relevant information in a large collection of documents by understanding the meaning and context of your search query, rather than just matching keywords. You input a search query and a dataset of text documents, and it outputs a list of documents most semantically similar to your query. Anyone who needs to quickly pinpoint specific information within vast amounts of text, such as a researcher or content analyst, would find this useful.

No commits in the last 6 months.

Use this if you need to perform advanced information retrieval on a large text dataset where traditional keyword searches fail to capture the true intent of your query.

Not ideal if your dataset is very small or if simple keyword matching is sufficient for your search needs.

information-retrieval text-analysis content-discovery document-search knowledge-management
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 0 / 25

How are scores calculated?

Stars

8

Forks

Language

Jupyter Notebook

License

Last pushed

Feb 19, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/MohammedAly22/Semantify"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.