pliang279/awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

48
/ 100
Emerging

This reading list helps AI researchers and students navigate the rapidly evolving field of multimodal machine learning. It curates academic papers, course materials, and workshops, covering core areas like multimodal representations and fusion, along with applications across various domains. Researchers and graduate students interested in developing AI systems that process and understand multiple data types (like text, images, and audio) would find this resource invaluable.

6,835 stars. No commits in the last 6 months.

Use this if you are an AI researcher or student seeking a comprehensive, organized collection of academic resources to deepen your understanding or identify research gaps in multimodal machine learning.

Not ideal if you are looking for ready-to-use open-source code libraries or practical tutorials for implementing multimodal models without a strong theoretical background.

AI-research machine-learning-research computer-vision natural-language-processing speech-recognition
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

6,835

Forks

897

Language

License

MIT

Last pushed

Aug 20, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/pliang279/awesome-multimodal-ml"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.