kyegomez/GPT4o

Community Open Source Implementation of GPT4o in PyTorch

48
/ 100
Emerging

This open-source project helps you create a system similar to advanced AI models that can understand and respond to both images and audio, as well as text. You input a combination of text, images, and sound, and the system can generate human-like text responses, new images, or even spoken audio. It's designed for AI researchers, hobbyists, or innovators looking to build custom multi-modal AI applications.

Use this if you want to experiment with or build your own AI model that can process and generate content across text, images, and audio.

Not ideal if you're looking for a ready-to-use, production-ready multi-modal AI without needing to dive into its underlying architecture or code.

multi-modal-AI AI-development image-generation audio-processing natural-language-generation
No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

26

Forks

5

Language

Shell

License

MIT

Last pushed

Mar 09, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/kyegomez/GPT4o"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.