
SYSTEM OVERVIEW.
IRIS is a voice-first system automation engine engineered to execute operations directly on your local workstation. Utilizing a low-latency Gemini 3.1 Live API integration with bidirectional WebRTC audio, IRIS translates natural language intent into local system actions.
What is Voice-First?
Traditional AI models are text-first: you type, wait, read. IRIS operates bidirectionally with real-time WebRTC audio streaming, bringing latency under 500ms. Speak naturally, interrupt anytime—IRIS listens, thinks, and executes dynamically.
What makes IRIS different?
- •Proprietary Orchestration: Protected, high-performance agent loops utilizing LangGraph state machines.
- •Pure Local Execution: Unlike web wrappers, IRIS executes CLI commands, manipulates desktop windows, and operates hardware.
- •System-Level Access: Sandboxed but deep access to directories, galleries, active processes, and ADB bridges.
Open Core Architecture
IRIS follows an open-core licensing model. The public repository controls the frontend shell, electron layout, and standard UI widgets. The core voice engine, neural orchestration loops, and system-level actions are packaged as protected main process modules to secure intellectual property.