04db423416
- 70 skills with code and documentation - Add .gitignore (ignore __pycache__, output/, temp/, venv/) - Clean up test intermediates and caches
55 lines
1.7 KiB
Markdown
55 lines
1.7 KiB
Markdown
# Agent Vision Awareness - Configuration
|
||
|
||
## Current OMO Compatibility
|
||
|
||
This skill is designed to work with **火山方舟 (VolcEngine) API**:
|
||
|
||
- ✅ **No custom agent delegation required** - uses direct API calls
|
||
- ✅ **Compatible with standard OMO configuration**
|
||
- ✅ **Works with existing text-only models**
|
||
- ✅ **Uses 火山方舟 API key** from OpenCode config
|
||
|
||
## Required Configuration
|
||
|
||
### 1. API Key Setup
|
||
火山方舟 API Key 已配置在 `~/.config/opencode/config.json` 中:
|
||
|
||
- **API Key**: `b0359bed-09f2-49e2-a53c-32ba057412e3`
|
||
- **Base URL**: `https://ark.cn-beijing.volces.com/api/coding/v3`
|
||
|
||
### 2. Supported Vision Model
|
||
|
||
**唯一支持的视觉模型**: `doubao-seed-code`
|
||
|
||
**注意**: Coding Plan 不支持专业视觉模型(如 doubao-vision-pro-32k)
|
||
|
||
### 3. Network Access
|
||
Ensure network connectivity to:
|
||
- `https://ark.cn-beijing.volces.com/api/coding/v3` (火山方舟 API)
|
||
|
||
## Removed Problematic Configurations
|
||
|
||
❌ **Custom Agent Delegation**: The `@multimodal-looker` approach has been **removed**
|
||
|
||
❌ **阿里云百炼**: 已停止使用 (API不可用)
|
||
|
||
## Working Implementation
|
||
|
||
✅ **Direct API Integration**: Uses 火山方舟 `doubao-seed-code`
|
||
✅ **Automatic Detection**: Built-in pattern matching for visual content
|
||
✅ **Graceful Degradation**: Clear error messages and fallback options
|
||
✅ **Simple Integration**: No special commands needed - just mention images naturally
|
||
|
||
## Verification
|
||
|
||
To verify the configuration is working:
|
||
|
||
1. Load the `agent-vision-awareness` skill
|
||
2. Test with: "分析这个截图 test.png" (replace with actual image path)
|
||
3. Should automatically detect and process the image
|
||
|
||
## Known Limitations
|
||
|
||
- 响应时间较长 (20-60秒)
|
||
- 不够稳定,偶尔超时
|
||
- 建议图片压缩到1024px可提升速度 |