danjohnvelasco/Filipino-Word-Embeddings
This repository contains download links to pretrained static word embeddings (word2vec, fastText) in Filipino.
This project provides pre-trained word embeddings for the Filipino language, including Taglish. It helps improve the understanding of text in Filipino by converting words into numerical representations, which are crucial for tasks like sentiment analysis, language translation, or information retrieval. Anyone working with Filipino text data, such as data scientists, computational linguists, or researchers, would find this useful for various natural language processing applications.
No commits in the last 6 months.
Use this if you need to analyze, process, or understand text written in Filipino or Taglish for machine learning applications.
Not ideal if your primary language of interest is not Filipino or if you are looking for embeddings that have been explicitly examined for biases.
Stars
9
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Jun 16, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/danjohnvelasco/Filipino-Word-Embeddings"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shibing624/text2vec
text2vec, text to vector....
predict-idlab/pyRDF2Vec
đ Python Implementation and Extension of RDF2Vec
IntuitionEngineeringTeam/chars2vec
Character-based word embeddings model based on RNN for handling real world texts
IITH-Compilers/IR2Vec
Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.