BUTSpeechFIT/BaySMM

A Bayesian Multilingual Document Model

28
/ 100
Experimental

This project helps researchers and data analysts working with large volumes of text in multiple languages to automatically identify and discover common topics across those documents. You input a collection of text documents, potentially in different languages, and it outputs an understanding of the overarching themes present, without needing pre-labeled examples for every language. It's ideal for anyone analyzing multilingual content for shared insights.

No commits in the last 6 months.

Use this if you need to find common themes and topics in a collection of documents written in several different languages without having to manually label each document for every language.

Not ideal if you only work with single-language documents or if your primary goal is sentiment analysis rather than topic identification.

multilingual-content-analysis topic-discovery cross-lingual-research document-classification natural-language-processing
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

9

Forks

4

Language

Python

License

Last pushed

Mar 23, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/BUTSpeechFIT/BaySMM"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.