Agent skill

VLM_Expert

实现基于视觉的 AI 对话能力,支持分析图像、描述视觉内容并进行多模态交互。

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/vlm-expert

SKILL.md

VLM (Vision Chat) 技能

使 AI 能够理解并响应结合了图像和文本提示的内容。

核心功能

  • 图像分析: 识别图片中的物体和场景。
  • 多图对比: 同时分析多张图片。

CLI 示例

bash
z-ai vision --prompt "图中有什么?" --image "./photo.jpg"

Expand your agent's capabilities with these related and highly-rated skills.

Didn't find tool you were looking for?

Be as detailed as possible for better results