Back to all MCPs
Vision MCP Server
Provides powerful GLM-4.6V vision capabilities including image analysis, video understanding, and UI/code extraction for MCP clients.
About
Vision MCP Server is a Z.AI implementation based on MCP, offering image analysis, video understanding, OCR, error diagnosis, diagram interpretation, and more using GLM-4.6V. Supports multiple tools like ui_to_artifact, extract_text_from_screenshot, and video_analysis. Requires Node.js >=22 and Z.AI API key.
Install
claude mcp add -s user zai-vision-mcp --env Z_AI_API_KEY=your_api_key Z_AI_MODE=ZAI -- npx -y @z_ai/mcp-serverFeatures
Image analysis & OCR
Video understanding (≤8MB)
UI to code/artifact conversion
Error screenshot diagnosis
Technical diagram interpretation
Data visualization analysis
Tags
VisionGLM-4VImageVideoOfficial