sergiosolorzano/TalkomicApp-Unity

Whisper, Stable Diffusion on U-Net, Chatgpt AI models, bundled in a Unity project. Locally run models powered by Onnxruntime. These transcribe podcasts' audio to text and generate contextual images tied to the transcribed text.

29
/ 100
Experimental

This tool transforms your podcast audio into a visually engaging experience. It takes an audio file, transcribes it into text, and then generates relevant images that sync with different sections of your podcast. This is ideal for podcasters, content creators, or educators who want to enhance audio-only content for platforms that support video and images.

No commits in the last 6 months.

Use this if you want to make your podcasts more captivating by automatically adding contextually relevant images to accompany the spoken words.

Not ideal if you're looking for a fully automated, hands-off solution for very long audio files, as some setup and configuration are required.

podcasting content-creation digital-publishing audio-visual-enhancement educational-content
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 4 / 25

How are scores calculated?

Stars

27

Forks

1

Language

C#

License

MIT

Last pushed

May 02, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/sergiosolorzano/TalkomicApp-Unity"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.