rohitrango/objects-that-sound

Unofficial Implementation of Google Deepmind's paper `Objects that Sound`

35
/ 100
Emerging

This project helps you find similar video frames or identify what an audio clip sounds like, even if you don't have descriptions. You provide video and audio clips, and it processes them to allow you to search for visually similar frames or find frames that match a particular sound. This tool is for researchers or media catalogers working with large collections of unlabeled audio-visual data.

No commits in the last 6 months.

Use this if you need to quickly retrieve relevant video frames based on either a visual example or an audio snippet, especially in large, unsorted multimedia archives.

Not ideal if you need precise localization of sound sources within a video or if your data isn't primarily focused on musical instruments or distinct sound events.

multimedia-archiving audio-analysis video-search content-retrieval unlabeled-data-analysis
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 8 / 25
Community 18 / 25

How are scores calculated?

Stars

83

Forks

16

Language

Python

License

Last pushed

May 07, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/rohitrango/objects-that-sound"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.