@hci-uwb

CECA - A Configurable Framework for Embodied Conversational AI Agents in Extended Reality

, , , , , and . 2026 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW), (2026)To be published.

Abstract

We present CECA, a configurable framework for embodied conversational AI agents in Unity-based extended reality (XR) applications. CECA employs a client–server architecture to decouple agent logic from game engine–based embodiment. Built on LiveKit Agents, our approach integrates speech-to-text (STT), large language models (LLMs), and text-to-speech (TTS) into a unified, streaming voice-to-voice pipeline configured via metadata rather than code changes. We outline how this architecture flexibly integrates local and cloud AI providers while mitigating limited provider SDK support in Unity. Finally, we highlight opportunities for future work, including multi-agent scenarios, higher-level templates for XR research, and systematic user studies.

Links and resources

Tags