vikesh8860/Multitasker
Multitasker is a machine learning project based on python that implements Photo Ocr, Photos to Pdf converter, Text to speech converter and Speech to text converter
This application helps you manage and convert various media formats for everyday tasks. It takes images, spoken words, or written text and transforms them into editable text, PDF documents, or audio files. Individuals like students, administrative assistants, or anyone needing to quickly convert content across different formats would find this useful.
No commits in the last 6 months.
Use this if you need a quick, all-in-one tool to convert images to text, spoken words to written text, written text to speech, or compile images into a PDF.
Not ideal if you require highly accurate OCR for complex documents with significant noise or advanced features for professional document management and audio production.
Stars
8
Forks
1
Language
Python
License
—
Category
Last pushed
Apr 28, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/vikesh8860/Multitasker"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python
KoljaB/RealtimeTTS
Converts text to speech in realtime
pndurette/gTTS
Python library and CLI tool to interface with Google Translate's text-to-speech API
n1teshy/yapper-tts
offline text to speech and free SOTA LLM APIs to let your programs speak to you
dputhier/pygtftk
A python package and a set of shell commands to handle GTF files