kritiksoman/Multimodal

Listen. Write. Speak. Read. Think.

21
/ 100
Experimental

This helps content creators, marketers, and analysts to process and understand spoken content and documents. You can input audio recordings (like speeches or interviews), YouTube video links, or documents (PDFs, DOCX files), and it will output summarized audio, sentiment analysis, anonymized speech, or answers to questions about the content. It's designed for anyone working with spoken words or text who needs to quickly extract insights or transform media.

No commits in the last 6 months.

Use this if you need to extract key information, analyze sentiment, anonymize identities, or convert documents into spoken audio from various media sources.

Not ideal if you require advanced video editing capabilities or highly specialized linguistic analysis that goes beyond sentiment and question answering.

content-analysis media-monitoring transcription audio-processing document-review
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

10

Forks

Language

Jupyter Notebook

License

MIT

Last pushed

Apr 15, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/kritiksoman/Multimodal"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.