In this talk, we’ll explain everything you need to know to build your own voice agent running entirely on a stock Raspberry Pi 5, with no internet connection required. The end result will be a system that enables you to speak commands or questions and hear responses in just a second or two. Going beyond speech recognition, the system includes intent recognition, enabling it to understand the purpose or goal behind a user’s input. Supported questions and responses are entirely customizable for your use cases. The system is built on the Moonshine Voice open-source library, which handles speech-to-text and intent recognition.

