hmo/skills

Files

T

hmo 04db423416 Initial commit: skills library

- 70 skills with code and documentation
- Add .gitignore (ignore __pycache__, output/, temp/, venv/)
- Clean up test intermediates and caches

2026-04-26 19:27:40 +08:00

1.7 KiB

Raw Blame History

Agent Vision Awareness - Configuration

Current OMO Compatibility

This skill is designed to work with 火山方舟 (VolcEngine) API:

✅ No custom agent delegation required - uses direct API calls
✅ Compatible with standard OMO configuration
✅ Works with existing text-only models
✅ Uses 火山方舟 API key from OpenCode config

Required Configuration

1. API Key Setup

火山方舟 API Key 已配置在 ~/.config/opencode/config.json 中:

API Key: b0359bed-09f2-49e2-a53c-32ba057412e3
Base URL: https://ark.cn-beijing.volces.com/api/coding/v3

2. Supported Vision Model

唯一支持的视觉模型: doubao-seed-code

注意: Coding Plan 不支持专业视觉模型（如 doubao-vision-pro-32k）

3. Network Access

Ensure network connectivity to:

https://ark.cn-beijing.volces.com/api/coding/v3 (火山方舟 API)

Removed Problematic Configurations

❌ Custom Agent Delegation: The @multimodal-looker approach has been removed

❌ 阿里云百炼: 已停止使用 (API不可用)

Working Implementation

✅ Direct API Integration: Uses 火山方舟 doubao-seed-code ✅ Automatic Detection: Built-in pattern matching for visual content ✅ Graceful Degradation: Clear error messages and fallback options ✅ Simple Integration: No special commands needed - just mention images naturally

Verification

To verify the configuration is working:

Load the agent-vision-awareness skill
Test with: "分析这个截图 test.png" (replace with actual image path)
Should automatically detect and process the image

Known Limitations

响应时间较长 (20-60秒)
不够稳定，偶尔超时
建议图片压缩到1024px可提升速度

1.7 KiB Raw Blame History Unescape Escape