IsaacRodgz/ConcatBERT
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
This tool helps researchers and data scientists classify content that combines both images and text, such as social media posts, product reviews, or movie data. It takes in an image and its accompanying text, processes them using established AI models, and then outputs a classification or prediction, helping to automate tasks like content moderation or sentiment analysis. The primary users are data scientists and researchers working with mixed media datasets.
No commits in the last 6 months.
Use this if you need to classify items where both an image and related text are crucial for making an accurate prediction, and you want a straightforward, effective baseline model.
Not ideal if your classification task relies solely on text or solely on images, or if you require extremely lightweight models for deployment on resource-constrained devices.
Stars
42
Forks
6
Language
Jupyter Notebook
License
—
Category
Last pushed
Aug 26, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/IsaacRodgz/ConcatBERT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
fidelity/textwiser
[AAAI 2021] TextWiser: Text Featurization Library
RandolphVI/Multi-Label-Text-Classification
About Muti-Label Text Classification Based on Neural Network.
ThilinaRajapakse/pytorch-transformers-classification
Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for...
ntumlgroup/LibMultiLabel
A library for multi-class and multi-label classification
xuyige/BERT4doc-Classification
Code and source for paper ``How to Fine-Tune BERT for Text Classification?``