OlgaChernytska/word2vec-pytorch

Implementation of the first paper on word2vec

/ 100

Emerging

This project helps natural language processing engineers and researchers understand and compare words based on their meanings. By inputting text data, it generates numerical representations (embeddings) for each word, capturing semantic relationships. These embeddings can then be used in various downstream NLP applications.

249 stars. No commits in the last 6 months.

Use this if you are a developer looking for a foundational implementation of Word2Vec to experiment with or integrate into your own PyTorch-based NLP projects.

Not ideal if you need a plug-and-play tool for immediate use in a business application without any coding or deep understanding of machine learning models.

natural-language-processing word-embeddings text-analysis machine-learning-engineering semantic-modeling

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 24 / 25

How are scores calculated?

Stars

249

Forks

Language

Python

License

—

Higher-rated alternatives

facebookresearch/stopes

A library for preparing data for machine translation research (monolingual preprocessing,...

Droidtown/ArticutAPI

API of Articut 中文斷詞 (兼具語意詞性標記)：「斷詞」又稱「分詞」，是中文資訊處理的基礎。Articut 不用機器學習，不需資料模型，只用現代白話中文語法規則，即能達到...

rkcosmos/deepcut

A Thai word tokenization library using Deep Neural Network

fukuball/jieba-php

"結巴"中文分詞：做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation:...

pytorch/text

Models, data loaders and abstractions for language processing, powered by PyTorch

Explore NLP Tools

All categories Trending NLP directory Insights