Back to all MCPs

Vision MCP Server

Provides powerful GLM-4.6V vision capabilities including image analysis, video understanding, and UI/code extraction for MCP clients.

About

Vision MCP Server is a Z.AI implementation based on MCP, offering image analysis, video understanding, OCR, error diagnosis, diagram interpretation, and more using GLM-4.6V. Supports multiple tools like ui_to_artifact, extract_text_from_screenshot, and video_analysis. Requires Node.js >=22 and Z.AI API key.

Install

claude mcp add -s user zai-vision-mcp --env Z_AI_API_KEY=your_api_key Z_AI_MODE=ZAI -- npx -y @z_ai/mcp-server

Features

Image analysis & OCR
Video understanding (≤8MB)
UI to code/artifact conversion
Error screenshot diagnosis
Technical diagram interpretation
Data visualization analysis

Tags

VisionGLM-4VImageVideoOfficial