ina-foss/twembeddings
Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora
This project helps researchers analyze large streams of Twitter data to identify emerging events without predefined categories. It takes a collection of tweets (English or French) as input and groups them into clusters, indicating distinct events. This tool is designed for academic researchers or data scientists who need to detect novel events from social media feeds.
No commits in the last 6 months. Available on PyPI.
Use this if you need to automatically discover new topics or incidents from raw Twitter data for research or analysis.
Not ideal if you're looking for a simple, out-of-the-box application without any programming or command-line interaction.
Stars
33
Forks
5
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jul 25, 2025
Commits (30d)
0
Dependencies
15
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/ina-foss/twembeddings"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings...
vinid/cade
Compass-aligned Distributional Embeddings. Align embeddings from different corpora
spcl/ncc
Neural Code Comprehension: A Learnable Representation of Code Semantics
criteo-research/CausE
Code for the Recsys 2018 paper entitled Causal Embeddings for Recommandation.
vintasoftware/entity-embed
PyTorch library for transforming entities like companies, products, etc. into vectors to support...