IsaacRodgz/multimodal-transformers-movies
Experiments with multimodal deep learning models based on transformers
This project helps researchers and academics in AI/ML to classify movie genres automatically. It takes multimodal data like video frames, audio, plot text, movie posters, and metadata as input and outputs predicted movie genres. It is primarily used by machine learning researchers and data scientists focused on multimodal deep learning and natural language processing in the entertainment domain.
No commits in the last 6 months.
Use this if you are an AI/ML researcher working with multimodal data to predict movie genres and want to experiment with advanced transformer architectures.
Not ideal if you are a filmmaker or marketer looking for a ready-to-use tool for audience segmentation or content recommendation without deep learning expertise.
Stars
11
Forks
1
Language
Jupyter Notebook
License
—
Category
Last pushed
Oct 09, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/IsaacRodgz/multimodal-transformers-movies"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dorarad/gansformer
Generative Adversarial Transformers
j-min/VL-T5
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
invictus717/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
rkansal47/MPGAN
The message passing GAN https://arxiv.org/abs/2106.11535 and generative adversarial particle...
Yachay-AI/byt5-geotagging
Confidence and Byt5 - based geotagging model predicting coordinates from text alone.