Amshaker/Mobile-O
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device
This project offers a unified solution for creating and understanding images directly on your mobile device. You input text prompts or existing images, and it generates new images, edits them, or answers questions about their content. It's designed for anyone who needs to quickly generate visual content or analyze images without relying on cloud services.
123 stars.
Use this if you need to perform real-time image generation, editing, or understanding tasks directly on your iPhone without an internet connection.
Not ideal if you require complex, high-resolution image generation beyond 512x512 or if you prefer cloud-based services for processing.
Stars
123
Forks
9
Language
Python
License
—
Category
Last pushed
Feb 24, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/Amshaker/Mobile-O"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
EndlessSora/focal-frequency-loss
[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis
JIA-Lab-research/DreamOmni2
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing...
SkyworkAI/UniPic
Open-source SOTA multi-image editing model