Documentation Core

SYSTEM OVERVIEW.

IRIS is a voice-first system automation engine engineered to execute operations directly on your local workstation. Utilizing a low-latency Gemini 3.1 Live API integration with bidirectional WebRTC audio, IRIS translates natural language intent into local system actions.

What is Voice-First?

Traditional AI models are text-first: you type, wait, read. IRIS operates bidirectionally with real-time WebRTC audio streaming, bringing latency under 500ms. Speak naturally, interrupt anytime—IRIS listens, thinks, and executes dynamically.

1. Voice InputFull Duplex Audio

➔

2. Gemini Live APIInference & Intent

➔

3. Native OS ExecLangGraph Tooling

What makes IRIS different?

•Proprietary Orchestration: Protected, high-performance agent loops utilizing LangGraph state machines.
•Pure Local Execution: Unlike web wrappers, IRIS executes CLI commands, manipulates desktop windows, and operates hardware.
•System-Level Access: Sandboxed but deep access to directories, galleries, active processes, and ADB bridges.

Open Core Architecture

IRIS follows an open-core licensing model. The public repository controls the frontend shell, electron layout, and standard UI widgets. The core voice engine, neural orchestration loops, and system-level actions are packaged as protected main process modules to secure intellectual property.