marl/openl3

OpenL3: Open-source deep audio and image embeddings

/ 100

Established

This tool helps researchers and engineers analyze sound and images by converting them into numerical representations called 'embeddings'. You feed in audio files, video frames, or images, and it outputs these embeddings which capture the semantic meaning of the content. This is useful for anyone working with multimedia data, like a sound engineer categorizing audio events or a data scientist building a content-based recommendation system.

581 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to compare, categorize, or search through large collections of audio or image data based on their semantic content.

Not ideal if you are a non-technical user looking for a ready-to-use application with a graphical interface.

audio-analysis image-processing multimedia-content-analysis data-science research

Stale 6m No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

581

Forks

Language

Jupyter Notebook

License

MIT

Related frameworks

iver56/audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the...

Rikorose/DeepFilterNet

Noise supression using deep filtering

torchsynth/torchsynth

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

archinetai/audio-data-pytorch

A collection of useful audio datasets and transforms for PyTorch.

ductho-le/WaveDL

A Scalable Deep Learning Framework for Wave-Based Inverse Problems

Explore ML Frameworks

All categories Trending ML Framework directory Insights