Mrkomiljon/awesome-generative-ai
Multimodal generative AI resources : talking heads, STT, TTS, image & video generation, and more.
This resource helps developers quickly find and implement modern generative AI technologies for their products. It provides curated links and practical entry points across various domains like speech, vision, and agentic AI. Developers who want to build and ship real products using advanced AI models and APIs will find this valuable.
Use this if you are a developer building agentic applications, voice products, multimodal workflows, or other AI-powered tools and need a focused guide to relevant resources.
Not ideal if you are looking for academic research papers, theoretical discussions, or a comprehensive, unopinionated list of all generative AI resources.
Stars
30
Forks
6
Language
Python
License
MIT
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/Mrkomiljon/awesome-generative-ai"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
NVIDIA/Maya-ACE
Maya-ACE: A Reference Client Implementation for NVIDIA ACE Audio2Face Service
OpenVGLab/OmniLottie
[CVPR 2026🔥] 🧑🎨 OmniLottie, an open-sourced multi-modal instructed vector animation generator...
jdh-algo/JoyHallo
JoyHallo: Digital human model for Mandarin
michaelzhang-ai/Speech2Video
ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"
Boese0601/Dyadic-Interaction-Modeling
[ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation