Gemini Live Agent Challenge 2026

LORE

The World Is Your Documentary
Point your camera at history...|

Point your camera at any place. Speak any question. LORE generates a living documentary — narrated, illustrated, cinematic — in real time. History has never been this immersive.

Watch Demo
Scroll

Four inputs.
One living documentary.

Every way you interact with the world becomes a doorway into history.

Features that emerge
from the fusion.

Six capabilities that make LORE unlike anything that came before.

🧠
Gemini Live API
Native multimodal interaction. Gemini hears your voice and sees your camera frames in a single bidirectional stream for sub-second responses.
Live Intelligence
📽️
Google Maps Platform
Places API and Directions API provide the ground truth for your location, ensuring every historical claim is tied to the exact coordinates you stand upon.
Spatial Grounding
🎞️
Veo 3.1 & Gemini 3.1
Cinematic video and high-fidelity illustrations generated on-the-fly to visualize historical events, architectural reconstructions, and alternate futures.
Generative Media
🎙️
Historical Characters
At significant landmarks, LORE narrations shift into dramatic interpretations of the people who shaped history, bringing voices from the past into the present.
Persona Engine
🌍
Alternate History
Ask "What if?" and watch the world transform. Speculative scenarios are grounded in physical visual context through generative fusion.
Lore Exclusive
Direct Multimodal Flow
A streamlined transparent proxy coordinates between the mobile app and Gemini Live API to ensure zero-friction documentary generation.
High-Performance Bridge

How LORE thinks.

A transparent, high-performance pipeline from your senses to the screen.

SYSTEM TOPOLOGY
Flutter Mobile (Frontend)
Gemini Live Proxy — Port 8090
Gemini Live API
Image Server
Port 8091
Video Server
Port 8092
Google Cloud Stack — Vertex AI · Cloud Run
01
Multimodal Capture
Real-time camera frames and audio stream via WebSocket to the Gemini Live Proxy. Vision and voice are processed concurrently.
02
Spatial Grounding
GPS coordinates and reverse-geocoding provide the location context, ensuring narration is grounded in physical reality.
03
Tool Call Orchestration
Gemini Live API triggers generate_image and generate_video tools, which are executed by the specialized backend servers.
04
Real-time Delivery
Interleaved content is streamed back to the mobile app for an immersive, zero-friction documentary experience.

Built with the full
Google Gemini stack.

Every layer purpose-chosen for real-time multimodal performance.

Live Intelligence
PRIMARY Gemini 2.5 Flash Live
Native Audio: vision + audio + session memory
Generation Layer
VIDEO Veo 3.1
1080p cinematic video generation
IMAGE Gemini 3.1 Flash Image
Rapid high-fidelity image generation
Spatial & Navigation
NAV Google Directions API
Walking navigation for GPS Tracking mode
MAP Google Maps SDK
Landmark visualization and route rendering
Infrastructure
COMPUTE Google Cloud Run
Serverless compute hosting the proxy and generation servers