InternLM/InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

46
/ 100
Emerging

This system helps professionals who need to understand and generate content from various media, combining videos, audio, images, and text. You input raw video footage, audio recordings, or high-resolution images, and it outputs detailed descriptions, generated articles, or even web page code. It's ideal for content creators, researchers, and anyone working with complex multimedia information.

2,922 stars. No commits in the last 6 months.

Use this if you need to analyze long-form video content, engage in multi-turn conversations about multiple images, or create detailed articles and webpages from visual and textual inputs.

Not ideal if your primary need is simple image classification or generating short text captions without complex contextual understanding.

content-creation multimedia-analysis video-understanding digital-publishing web-development
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

2,922

Forks

177

Language

Python

License

Apache-2.0

Last pushed

May 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/InternLM/InternLM-XComposer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.