alibaba-damo-academy/SpokenNLP

A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.

49
/ 100
Emerging

This collection of projects helps analyze spoken language and long documents. It can take lecture videos, spoken dialogue, or long text, and output things like topic boundaries, keyphrases, or segmented documents. Researchers and data scientists working with large volumes of conversational data or textual content would find these useful.

124 stars.

Use this if you need to extract meaningful segments or information from audio recordings of speeches, meetings, or long written documents.

Not ideal if you are looking for a simple, out-of-the-box application for general speech-to-text transcription or basic text summarization.

speech analysis document segmentation keyphrase extraction multimodal analysis natural language processing research
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

124

Forks

12

Language

Python

License

Apache-2.0

Last pushed

Mar 11, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/alibaba-damo-academy/SpokenNLP"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.