Jimin9401/avocado

AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain

/ 100

Experimental

This project helps developers improve the performance of natural language processing (NLP) models by tailoring the vocabulary to specific datasets. It takes your existing text data and generates a specialized vocabulary, which then boosts the accuracy of downstream NLP tasks like text classification. Data scientists and machine learning engineers working with diverse textual data would find this useful.

No commits in the last 6 months.

Use this if you are a developer looking to fine-tune pre-trained language models for better performance on domain-specific text datasets without needing external linguistic resources.

Not ideal if you are a non-technical user seeking a ready-to-use application for text analysis or if you don't have experience with machine learning frameworks like PyTorch.

natural-language-processing machine-learning-engineering text-classification model-optimization domain-adaptation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

kyzhouhzau/NLPGNN

1. Use BERT, ALBERT and GPT2 as tensorflow2.0's layer. 2. Implement GCN, GAN, GIN and...

IndexFziQ/GNN4NLP-Papers

A list of recent papers about Graph Neural Network methods applied in NLP areas.

qipeng/gcn-over-pruned-trees

Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch...

kenqgu/Text-GCN

A PyTorch implementation of "Graph Convolutional Networks for Text Classification." (AAAI 2019)

daiquocnguyen/Graph-Transformer

Universal Graph Transformer Self-Attention Networks (TheWebConf WWW 2022) (Pytorch and Tensorflow)

Explore NLP Tools

All categories Trending NLP directory Insights