sanjifr3/Narrator
An image and video description generator using an CNN-RNN based architecture.
This project helps content creators automatically generate spoken descriptions for images and videos. You provide your visual content, and it outputs audio narration designed to make your content more accessible to people with vision impairment. This is ideal for video producers, educators, or social media managers looking to enhance content inclusivity.
No commits in the last 6 months.
Use this if you need to quickly add descriptive audio tracks to your visual content for accessibility purposes, without manually writing and recording every description.
Not ideal if you require highly nuanced or creative narrative descriptions that only a human can provide, or if your primary need is just text captions rather than audio.
Stars
25
Forks
2
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jul 16, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/sanjifr3/Narrator"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
AlimTleuliyev/image-to-audio
Image Captioning and Text-to-Speech
sidphbot/visual-to-audio-aid-for-visually-impaired
A system to process visual input on timed frames to produce sensible audio aid in accordance...
Abhradipta/OCR-With-Read-Out-Loud-Using-Python
An Optical Character Recognition (OCR) System designed using Python to read the contents out loud.
SARIT42/image-Annotation-Speech
Explaining the contents of an image in the form of speech through caption generation using...
ahmedgulabkhan/TEI2S
TEI2S is a project which is really helpful for the visually impaired, in a sense that it takes...