kritiksoman/Multimodal
Listen. Write. Speak. Read. Think.
This helps content creators, marketers, and analysts to process and understand spoken content and documents. You can input audio recordings (like speeches or interviews), YouTube video links, or documents (PDFs, DOCX files), and it will output summarized audio, sentiment analysis, anonymized speech, or answers to questions about the content. It's designed for anyone working with spoken words or text who needs to quickly extract insights or transform media.
No commits in the last 6 months.
Use this if you need to extract key information, analyze sentiment, anonymize identities, or convert documents into spoken audio from various media sources.
Not ideal if you require advanced video editing capabilities or highly specialized linguistic analysis that goes beyond sentiment and question answering.
Stars
10
Forks
—
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Apr 15, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/kritiksoman/Multimodal"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SlapBot/sounder
An intent recognizing algorithm to predict the intent of a given text.
plkmo/NLP_Toolkit
Library of state-of-the-art models (PyTorch) for NLP tasks
loretoparisi/hf-experiments
Experiments with Hugging Face 🔬 🤗
hjian42/Natural-Language-Processing-Nanodegree
Natural Language Processing Nanodegree from Udacity Platform, in which I implement Hidden Markov...
Philipp2211/Udacity-Natural-Language-Processing-Nanodegree
This repository contains all my solutions to the tutorials/projects of the Udacity Natural...