zai-org/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
This project offers a versatile tool for understanding and interacting with images and text in both Chinese and English. You can provide an image and ask questions about its content, and the model will generate descriptive answers. It's designed for anyone who needs to quickly extract information from images or engage in multi-modal conversations.
4,169 stars. No commits in the last 6 months.
Use this if you need to ask questions about an image's content or engage in text-based conversations that incorporate visual information, especially in Chinese or English.
Not ideal if you require absolute factual accuracy for critical applications, as it's still an early version with known limitations like occasional factual errors or hallucination.
Stars
4,169
Forks
425
Language
Python
License
Apache-2.0
Category
Last pushed
Aug 23, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/zai-org/VisualGLM-6B"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
hoochanlon/hamuleite
🏔️国立台湾大学、新加坡国立大学、早稻田大学、东京大学,中央研究院(台湾)以及中国重点高校及科研机构,社科、经济、数学、博弈论、哲学、系统工程类学术论文等知识库。
JiauZhang/chatchat
Large Language Models Python API
yuanjie-ai/ChatLLM
轻松玩转LLM兼容openai&langchain,支持文心一言、讯飞星火、腾讯混元、智谱ChatGLM等
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
mymusise/ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案