Files
skills/agent-vision-awareness/CONFIG.md
T
hmo 04db423416 Initial commit: skills library
- 70 skills with code and documentation
- Add .gitignore (ignore __pycache__, output/, temp/, venv/)
- Clean up test intermediates and caches
2026-04-26 19:27:40 +08:00

1.7 KiB
Raw Blame History

Agent Vision Awareness - Configuration

Current OMO Compatibility

This skill is designed to work with 火山方舟 (VolcEngine) API:

  • No custom agent delegation required - uses direct API calls
  • Compatible with standard OMO configuration
  • Works with existing text-only models
  • Uses 火山方舟 API key from OpenCode config

Required Configuration

1. API Key Setup

火山方舟 API Key 已配置在 ~/.config/opencode/config.json 中:

  • API Key: b0359bed-09f2-49e2-a53c-32ba057412e3
  • Base URL: https://ark.cn-beijing.volces.com/api/coding/v3

2. Supported Vision Model

唯一支持的视觉模型: doubao-seed-code

注意: Coding Plan 不支持专业视觉模型(如 doubao-vision-pro-32k

3. Network Access

Ensure network connectivity to:

  • https://ark.cn-beijing.volces.com/api/coding/v3 (火山方舟 API)

Removed Problematic Configurations

Custom Agent Delegation: The @multimodal-looker approach has been removed

阿里云百炼: 已停止使用 (API不可用)

Working Implementation

Direct API Integration: Uses 火山方舟 doubao-seed-code Automatic Detection: Built-in pattern matching for visual content Graceful Degradation: Clear error messages and fallback options Simple Integration: No special commands needed - just mention images naturally

Verification

To verify the configuration is working:

  1. Load the agent-vision-awareness skill
  2. Test with: "分析这个截图 test.png" (replace with actual image path)
  3. Should automatically detect and process the image

Known Limitations

  • 响应时间较长 (20-60秒)
  • 不够稳定,偶尔超时
  • 建议图片压缩到1024px可提升速度