sergiosolorzano/TalkomicApp-Unity
Whisper, Stable Diffusion on U-Net, Chatgpt AI models, bundled in a Unity project. Locally run models powered by Onnxruntime. These transcribe podcasts' audio to text and generate contextual images tied to the transcribed text.
This tool transforms your podcast audio into a visually engaging experience. It takes an audio file, transcribes it into text, and then generates relevant images that sync with different sections of your podcast. This is ideal for podcasters, content creators, or educators who want to enhance audio-only content for platforms that support video and images.
No commits in the last 6 months.
Use this if you want to make your podcasts more captivating by automatically adding contextually relevant images to accompany the spoken words.
Not ideal if you're looking for a fully automated, hands-off solution for very long audio files, as some setup and configuration are required.
Stars
27
Forks
1
Language
C#
License
MIT
Category
Last pushed
May 02, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/sergiosolorzano/TalkomicApp-Unity"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
RageAgainstThePixel/OpenAI-DotNet
A Non-Official OpenAI RESTful API Client for DotNet
jeffdapaz/VisualChatGPTStudio
Add chatGPT functionalities directly on Visual Studio
RageAgainstThePixel/com.openai.unity
A Non-Official OpenAI Rest Client for Unity (UPM)
betalgo/openai
.NET library for the OpenAI service API by Betalgo Ranul
Azure-Samples/azure-search-openai-demo-csharp
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure...