whwu95/GPT4Vis

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

/ 100

Emerging

This project helps researchers and developers understand what GPT-4 and GPT-4V (GPT-4 with Vision) can do for automatically classifying images, videos, and 3D point clouds without needing specific training data for each task. It takes raw visual data and produces category labels or descriptions for those visuals. Anyone working on computer vision tasks who wants to leverage cutting-edge large language models for 'zero-shot' recognition would use this.

185 stars. No commits in the last 6 months.

Use this if you are a researcher or developer exploring the capabilities of GPT-4 for visual recognition across different data types like images, videos, and point clouds, especially for zero-shot tasks where you lack labeled training data.

Not ideal if you are looking for a ready-to-use, production-grade application or a tool that does not require deep technical understanding of large language models and vision APIs.

computer-vision image-recognition video-analysis 3d-point-cloud-classification zero-shot-learning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

185

Forks

Language

Python

License

MIT

Higher-rated alternatives

vorniches/snap2txt

Convert your project into a text prompt.

promptslab/openai-detector

AI classifier for indicating AI-written text

awekrx/ChatGPT-MidJourney-prompt

This is a ChatGPT based prompt generation model for MidJorney. The purpose of this model is to...

donwany/gpt3datagen

GPT3DataGen is a python package that generates fake data for fine-tuning openai models

flatypus/flowchat

A Python library for building clean and efficient multi-step prompt chains.

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights