# Agent Vision Awareness - Configuration

## Current OMO Compatibility

This skill is designed to work with **火山方舟 (VolcEngine) API**:

- ✅ **No custom agent delegation required** - uses direct API calls
- ✅ **Compatible with standard OMO configuration**
- ✅ **Works with existing text-only models**
- ✅ **Uses 火山方舟 API key** from OpenCode config

## Required Configuration

### 1. API Key Setup
火山方舟 API Key 已配置在 `~/.config/opencode/config.json` 中:

- **API Key**: `b0359bed-09f2-49e2-a53c-32ba057412e3`
- **Base URL**: `https://ark.cn-beijing.volces.com/api/coding/v3`

### 2. Supported Vision Model

**唯一支持的视觉模型**: `doubao-seed-code`

**注意**: Coding Plan 不支持专业视觉模型（如 doubao-vision-pro-32k）

### 3. Network Access
Ensure network connectivity to:
- `https://ark.cn-beijing.volces.com/api/coding/v3` (火山方舟 API)

## Removed Problematic Configurations

❌ **Custom Agent Delegation**: The `@multimodal-looker` approach has been **removed**

❌ **阿里云百炼**: 已停止使用 (API不可用)

## Working Implementation

✅ **Direct API Integration**: Uses 火山方舟 `doubao-seed-code`
✅ **Automatic Detection**: Built-in pattern matching for visual content
✅ **Graceful Degradation**: Clear error messages and fallback options
✅ **Simple Integration**: No special commands needed - just mention images naturally

## Verification

To verify the configuration is working:

1. Load the `agent-vision-awareness` skill
2. Test with: "分析这个截图 test.png" (replace with actual image path)
3. Should automatically detect and process the image

## Known Limitations

- 响应时间较长 (20-60秒)
- 不够稳定，偶尔超时
- 建议图片压缩到1024px可提升速度