alibaba/AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

48
/ 100
Emerging

This project offers powerful AI models to help with tasks involving language, images, and video. You can feed in text, images, or video and get back things like detailed descriptions, answers to questions, or intelligent conversational responses. It's for researchers, content creators, or business analysts who work with complex multimedia information and need advanced understanding or generation capabilities.

2,051 stars. No commits in the last 6 months.

Use this if you need cutting-edge AI for tasks like analyzing documents, generating video captions, understanding visual content, or building sophisticated dialogue systems.

Not ideal if you are looking for a simple, off-the-shelf application with a user interface, as this project provides foundational models and techniques for developers.

multimodal-AI natural-language-processing computer-vision video-analysis document-understanding
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

2,051

Forks

299

Language

Python

License

Apache-2.0

Last pushed

Mar 19, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/alibaba/AliceMind"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.