04db423416
- 70 skills with code and documentation - Add .gitignore (ignore __pycache__, output/, temp/, venv/) - Clean up test intermediates and caches
1.7 KiB
1.7 KiB
Agent Vision Awareness - Configuration
Current OMO Compatibility
This skill is designed to work with 火山方舟 (VolcEngine) API:
- ✅ No custom agent delegation required - uses direct API calls
- ✅ Compatible with standard OMO configuration
- ✅ Works with existing text-only models
- ✅ Uses 火山方舟 API key from OpenCode config
Required Configuration
1. API Key Setup
火山方舟 API Key 已配置在 ~/.config/opencode/config.json 中:
- API Key:
b0359bed-09f2-49e2-a53c-32ba057412e3 - Base URL:
https://ark.cn-beijing.volces.com/api/coding/v3
2. Supported Vision Model
唯一支持的视觉模型: doubao-seed-code
注意: Coding Plan 不支持专业视觉模型(如 doubao-vision-pro-32k)
3. Network Access
Ensure network connectivity to:
https://ark.cn-beijing.volces.com/api/coding/v3(火山方舟 API)
Removed Problematic Configurations
❌ Custom Agent Delegation: The @multimodal-looker approach has been removed
❌ 阿里云百炼: 已停止使用 (API不可用)
Working Implementation
✅ Direct API Integration: Uses 火山方舟 doubao-seed-code
✅ Automatic Detection: Built-in pattern matching for visual content
✅ Graceful Degradation: Clear error messages and fallback options
✅ Simple Integration: No special commands needed - just mention images naturally
Verification
To verify the configuration is working:
- Load the
agent-vision-awarenessskill - Test with: "分析这个截图 test.png" (replace with actual image path)
- Should automatically detect and process the image
Known Limitations
- 响应时间较长 (20-60秒)
- 不够稳定,偶尔超时
- 建议图片压缩到1024px可提升速度