hassyGo/charNgram2vec
Pre-training character n-gram embeddings
This tool helps natural language processing researchers or practitioners prepare text data for advanced machine learning models. It processes raw text and generates character n-gram embeddings, which are numerical representations of short sequences of characters within words. These embeddings serve as a foundational input for neural networks used in various NLP tasks.
No commits in the last 6 months.
Use this if you are developing or training neural networks for natural language processing and need to create character-level feature representations from text.
Not ideal if you are looking for a pre-packaged solution for a specific NLP task, as this tool focuses on an underlying data preparation step rather than an end-user application.
Stars
23
Forks
4
Language
C++
License
MIT
Category
Last pushed
Nov 01, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/hassyGo/charNgram2vec"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shibing624/text2vec
text2vec, text to vector....
predict-idlab/pyRDF2Vec
đ Python Implementation and Extension of RDF2Vec
IntuitionEngineeringTeam/chars2vec
Character-based word embeddings model based on RNN for handling real world texts
IITH-Compilers/IR2Vec
Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.