# Agent Vision Awareness - Configuration ## Current OMO Compatibility This skill is designed to work with **火山方舟 (VolcEngine) API**: - ✅ **No custom agent delegation required** - uses direct API calls - ✅ **Compatible with standard OMO configuration** - ✅ **Works with existing text-only models** - ✅ **Uses 火山方舟 API key** from OpenCode config ## Required Configuration ### 1. API Key Setup 火山方舟 API Key 已配置在 `~/.config/opencode/config.json` 中: - **API Key**: `b0359bed-09f2-49e2-a53c-32ba057412e3` - **Base URL**: `https://ark.cn-beijing.volces.com/api/coding/v3` ### 2. Supported Vision Model **唯一支持的视觉模型**: `doubao-seed-code` **注意**: Coding Plan 不支持专业视觉模型(如 doubao-vision-pro-32k) ### 3. Network Access Ensure network connectivity to: - `https://ark.cn-beijing.volces.com/api/coding/v3` (火山方舟 API) ## Removed Problematic Configurations ❌ **Custom Agent Delegation**: The `@multimodal-looker` approach has been **removed** ❌ **阿里云百炼**: 已停止使用 (API不可用) ## Working Implementation ✅ **Direct API Integration**: Uses 火山方舟 `doubao-seed-code` ✅ **Automatic Detection**: Built-in pattern matching for visual content ✅ **Graceful Degradation**: Clear error messages and fallback options ✅ **Simple Integration**: No special commands needed - just mention images naturally ## Verification To verify the configuration is working: 1. Load the `agent-vision-awareness` skill 2. Test with: "分析这个截图 test.png" (replace with actual image path) 3. Should automatically detect and process the image ## Known Limitations - 响应时间较长 (20-60秒) - 不够稳定,偶尔超时 - 建议图片压缩到1024px可提升速度