alibaba/AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

/ 100

Emerging

This project offers powerful AI models to help with tasks involving language, images, and video. You can feed in text, images, or video and get back things like detailed descriptions, answers to questions, or intelligent conversational responses. It's for researchers, content creators, or business analysts who work with complex multimedia information and need advanced understanding or generation capabilities.

2,051 stars. No commits in the last 6 months.

Use this if you need cutting-edge AI for tasks like analyzing documents, generating video captions, understanding visual content, or building sophisticated dialogue systems.

Not ideal if you are looking for a simple, off-the-shelf application with a user interface, as this project provides foundational models and techniques for developers.

multimodal-AI natural-language-processing computer-vision video-analysis document-understanding

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

2,051

Forks

299

Language

Python

License

Apache-2.0

Higher-rated alternatives

neuml/paperai

📄 🤖 AI for medical and scientific papers

asreview/asreview-makita

Workflow generator for simulation studies using the command line interface of ASReview LAB

supriya46788/Research-Paper-Organizer

Open-source beginner-friendly project

allenai/papermage

library supporting NLP and CV research on scientific papers

Tavris1/AI-Toolkit-Easy-Install

One-click Portable Windows installation of 'AI-Toolkit by Ostris'

Explore ML Frameworks

All categories Trending ML Framework directory Insights