kyegomez/Pegasus

PegasusX: The Future of Multimodal Embeddings 🦄 🦄

/ 100

Emerging

This project helps anyone working with diverse data types by turning text, images, video, and audio into 'embeddings' – numerical representations that capture the meaning of the data. You put in your raw data, and it gives you back these smart embeddings, which can then be used for tasks like searching similar content or classifying information. It's designed for researchers, data analysts, or machine learning practitioners who need to process and understand complex, mixed-media datasets.

No commits in the last 6 months.

Use this if you need to convert various types of data (text, images, audio, video) into a unified, meaningful numerical format for analysis or machine learning applications.

Not ideal if you are looking for an out-of-the-box application rather than a tool to generate data representations for further model development.

multimedia-analysis data-embedding information-retrieval machine-learning-data-prep content-understanding

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

ssrajadh/sentrysearch

Semantic search over videos using Gemini Embedding 2.

hayabhay/frogbase

Transform audio-visual content into navigable knowledge.

zilliz-bootcamp/audio_search

This project use PANNs for audio tagging and sound event detection, and finally get audio...

tomfalainen/word_spotting

Semantic and Verbatim Word Spotting in Torch

ashvardanian/SwiftSemanticSearch

Real-time on-device text-to-image and image-to-image Semantic Search with video stream camera...

Explore Embedding Tools

All categories Trending Embeddings directory Insights