vikesh8860/Multitasker

Multitasker is a machine learning project based on python that implements Photo Ocr, Photos to Pdf converter, Text to speech converter and Speech to text converter

/ 100

Experimental

This application helps you manage and convert various media formats for everyday tasks. It takes images, spoken words, or written text and transforms them into editable text, PDF documents, or audio files. Individuals like students, administrative assistants, or anyone needing to quickly convert content across different formats would find this useful.

No commits in the last 6 months.

Use this if you need a quick, all-in-one tool to convert images to text, spoken words to written text, written text to speech, or compile images into a PDF.

Not ideal if you require highly accurate OCR for complex documents with significant noise or advanced features for professional document management and audio production.

document-conversion transcription dictation image-to-text audio-narration

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Higher-rated alternatives

nateshmbhat/pyttsx3

Offline Text To Speech synthesis for python

KoljaB/RealtimeTTS

Converts text to speech in realtime

pndurette/gTTS

Python library and CLI tool to interface with Google Translate's text-to-speech API

n1teshy/yapper-tts

offline text to speech and free SOTA LLM APIs to let your programs speak to you

dputhier/pygtftk

A python package and a set of shell commands to handle GTF files

Explore Voice AI Tools

All categories Trending Voice AI directory Insights