yyuncong/TempCLR

[ICLR 2023] Temporal Alignment Representations with Contrastive Learning

/ 100

Experimental

This project helps machine learning researchers improve how computers understand the sequence of actions in a video, like steps in a recipe or instructions. It takes raw video and text descriptions as input and produces models that can better align visual events with their corresponding textual explanations. Researchers working on video analysis, action recognition, or multimodal learning would find this useful.

No commits in the last 6 months.

Use this if you are a machine learning researcher aiming to build or evaluate models that understand and align temporal information in videos with accompanying text, especially for tasks like action step localization or video retrieval.

Not ideal if you are looking for a ready-to-use application for end-users or if your primary interest is not in developing and researching video-text alignment algorithms.

video-understanding multimodal-learning action-recognition temporal-analysis machine-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

AdaptiveMotorControlLab/CEBRA

Learnable latent embeddings for joint behavioral and neural analysis - Official implementation of CEBRA

theolepage/sslsv

Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker...

PaddlePaddle/PASSL

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision...

YGZWQZD/LAMDA-SSL

30 Semi-Supervised Learning Algorithms

ModSSC/ModSSC

ModSSC: A Modular Framework for Semi Supervised Classification

Explore ML Frameworks

All categories Trending ML Framework directory Insights