Amshaker/Mobile-O

Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device

/ 100

Emerging

This project offers a unified solution for creating and understanding images directly on your mobile device. You input text prompts or existing images, and it generates new images, edits them, or answers questions about their content. It's designed for anyone who needs to quickly generate visual content or analyze images without relying on cloud services.

123 stars.

Use this if you need to perform real-time image generation, editing, or understanding tasks directly on your iPhone without an internet connection.

Not ideal if you require complex, high-resolution image generation beyond 512x512 or if you prefer cloud-based services for processing.

mobile-content-creation on-device-AI visual-question-answering image-editing text-to-image

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 11 / 25

Community 11 / 25

How are scores calculated?

Stars

123

Forks

Language

Python

License

—

Higher-rated alternatives

Vchitect/VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

EndlessSora/focal-frequency-loss

[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis

JIA-Lab-research/DreamOmni2

This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing...

SkyworkAI/UniPic

Open-source SOTA multi-image editing model

Explore Diffusion Models

All categories Trending Diffusion directory Insights