xrenaa/Retriever

[ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"

/ 100

Experimental

This project helps researchers and developers explore the core components of various media, like speech or images, by separating 'what it is' from 'how it looks or sounds.' It takes input data (e.g., an audio clip, an image) and outputs disentangled content and style representations. Scientists and engineers working on advanced media manipulation or generation tasks would use this.

No commits in the last 6 months.

Use this if you need to perform unsupervised disentanglement of content and style from various media types, for applications like zero-shot voice conversion, co-part segmentation, or style transfer.

Not ideal if you're looking for a ready-to-use application for end-users, as this provides research code for underlying representation learning.

voice-conversion style-transfer media-synthesis unsupervised-learning computer-vision

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

Westlake-AI/openmixup

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

YU1ut/MixMatch-pytorch

Code for "MixMatch - A Holistic Approach to Semi-Supervised Learning"

kamata1729/QATM_pytorch

Pytorch Implementation of QATM:Quality-Aware Template Matching For Deep Learning

nttcslab/msm-mae

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations

rgeirhos/generalisation-humans-DNNs

Data, code & materials from the paper "Generalisation in humans and deep neural networks" (NeurIPS 2018)

Explore ML Frameworks

All categories Trending ML Framework directory Insights