Files
skills/agent-vision-awareness/CONFIG.md
T
hmo 04db423416 Initial commit: skills library
- 70 skills with code and documentation
- Add .gitignore (ignore __pycache__, output/, temp/, venv/)
- Clean up test intermediates and caches
2026-04-26 19:27:40 +08:00

55 lines
1.7 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Agent Vision Awareness - Configuration
## Current OMO Compatibility
This skill is designed to work with **火山方舟 (VolcEngine) API**:
-**No custom agent delegation required** - uses direct API calls
-**Compatible with standard OMO configuration**
-**Works with existing text-only models**
-**Uses 火山方舟 API key** from OpenCode config
## Required Configuration
### 1. API Key Setup
火山方舟 API Key 已配置在 `~/.config/opencode/config.json` 中:
- **API Key**: `b0359bed-09f2-49e2-a53c-32ba057412e3`
- **Base URL**: `https://ark.cn-beijing.volces.com/api/coding/v3`
### 2. Supported Vision Model
**唯一支持的视觉模型**: `doubao-seed-code`
**注意**: Coding Plan 不支持专业视觉模型(如 doubao-vision-pro-32k
### 3. Network Access
Ensure network connectivity to:
- `https://ark.cn-beijing.volces.com/api/coding/v3` (火山方舟 API)
## Removed Problematic Configurations
**Custom Agent Delegation**: The `@multimodal-looker` approach has been **removed**
**阿里云百炼**: 已停止使用 (API不可用)
## Working Implementation
**Direct API Integration**: Uses 火山方舟 `doubao-seed-code`
**Automatic Detection**: Built-in pattern matching for visual content
**Graceful Degradation**: Clear error messages and fallback options
**Simple Integration**: No special commands needed - just mention images naturally
## Verification
To verify the configuration is working:
1. Load the `agent-vision-awareness` skill
2. Test with: "分析这个截图 test.png" (replace with actual image path)
3. Should automatically detect and process the image
## Known Limitations
- 响应时间较长 (20-60秒)
- 不够稳定,偶尔超时
- 建议图片压缩到1024px可提升速度