
OmniSight System
OmniSight is SkyrimNet’s AI-powered image-to-text system that gives the LLM real, continuous visual awareness of the game world. It captures in-game screenshots automatically or on demand, analyzes them with a vision model, and generates detailed descriptions , injected in the NPC surrounding context and dynamic behaviors.
Core Features
📸 Automatic Screenshot Capture
- Captures images during player-initiated dialogue, when entering new areas, or via the background worker.
- Provides passive, continuous world awareness without any user interaction.
🎯 Hotkey Target Scan
- Press a hotkey to capture and describe the NPC or object under the crosshair.
- Hold the hotkey to capture the player character.
- Actor captures override the actor’s static bio “appearance” text with real visual data.
🧠 Vision-Enhanced LLM Awareness
- AI can “see” NPCs, posture, clothing, objects, structures, clutter, terrain, and architecture.
- Automatically respects first-person vs third-person camera perspective so the model never misinterprets the player model.
🖼️ Dashboard Integration
- Dashboard displays the latest screenshot and the AI-generated description.
- All captures are stored on a dedicated OmniSight page for browsing.
- Useful for debugging and understanding AI/NPC reactions.
Configuration Options
General
- Enable or disable OmniSight globally.
Background Worker
- Passive continuous capturing.
- Adjustable:
- Check interval
- Max concurrent captures
- Cooldown between captures
- Extremely low-cost by default: < $0.0004 per capture.
Capture Conditions
- Capture only when player is idle.
- Require fully loaded 3D models.
- Optional daylight-only outdoor capturing.
Subjects
- Toggle Actor captures (hotkey-driven).
- Toggle Location captures (automatic).
Vision Prompt System
OmniSight uses specialized prompt templates for:
- Actor descriptions (appearance, visible equipment, build, posture)
- Location descriptions (architecture, objects, terrain, layout)
- Immediate action scans (real-time environment + NPC activity)
All templates enforce:
- Strict grounding in what is visible
- No UI-based naming or unseen detail inference
- Authoritative NPC-visible list rules
- First-/third-person perspective logic
- Objective, lore-appropriate language
What OmniSight Enables
- Dynamic AI responses based on real visual context
- More lifelike NPC interactions and situational awareness
- Reduced need for data integration
- A world where the AI actually sees what is happening