MohammedAly22/Semantify

A detailed comparison between 3 different techniques (TF-IDF, Doc2Vec, and Sentence Transformers) for performing semantic search on a huge dataset

/ 100

Experimental

This project helps you find the most relevant information in a large collection of documents by understanding the meaning and context of your search query, rather than just matching keywords. You input a search query and a dataset of text documents, and it outputs a list of documents most semantically similar to your query. Anyone who needs to quickly pinpoint specific information within vast amounts of text, such as a researcher or content analyst, would find this useful.

No commits in the last 6 months.

Use this if you need to perform advanced information retrieval on a large text dataset where traditional keyword searches fail to capture the true intent of your query.

Not ideal if your dataset is very small or if simple keyword matching is sufficient for your search needs.

information-retrieval text-analysis content-discovery document-search knowledge-management

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

—

Featured in

Things AI Won't Tell You About Building a Voice App

Higher-rated alternatives

patrickfrank1/chesspos

Embedding based chess position search and embedding learning for chess positions

alexklibisz/elastiknn

Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using...

freedmand/semantra

Multi-tool for semantic search

koursaros-ai/nboost

NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve...

Mubelotix/SimRepo

Shows similar repositories in the sidebar

Explore Embedding Tools

All categories Trending Embeddings directory Insights