Voice Mode
Hermes Agent supports full voice interaction across CLI and messaging platforms. Talk to the agent using your microphone, hear spoken replies, and have live voi
Hermes Agent supports full voice interaction across CLI and messaging platforms. Talk to the agent using your microphone, hear spoken replies, and have live voice conversations in Discord voice channels. If you want a practical setup walkthrough with recommended configurations and real usage patterns, see Use Voice Mode with Hermes. …
What this page covers
- Prerequisites
- Overview
- Requirements
- Python Packages
- System Dependencies
- API Keys
- CLI Voice Mode
- Quick Start
- How It Works
- Silence Detection
- Streaming TTS
- Hallucination Filter
- Gateway Voice Reply (Telegram & Discord)
- Discord: Channels vs DMs
- Commands
- Modes
- Platform Delivery
- Discord Voice Channels
- Setup
- Start the Gateway
- Commands
- How It Works
- Text Channel Integration
- Echo Prevention
- Access Control
- Configuration Reference
- config.yaml
- Environment Variables
- STT Provider Comparison
- TTS Provider Comparison
- Troubleshooting
- "No audio device found" (CLI)
- Bot doesn't respond in Discord server channels
- Bot joins VC but doesn't hear me
- Bot hears me but doesn't respond
- Bot responds in text but not in voice channel
- Whisper returns garbage text
Section outline mirrored from the official Hermes Agent documentation. Follow any heading to read the complete text on the source site.
More in Core Features
Features Overview
Hermes Agent includes a rich set of capabilities that extend far beyond basic chat. From persistent memory and file aware context to browser automation and voic
Tools & Toolsets
Tools are functions that extend the agent's capabilities. They're organized into logical toolsets that can be enabled or disabled per platform. Available Tools
Skills System
Skills are on demand knowledge documents the agent can load when needed. They follow a progressive disclosure pattern to minimize token usage and are compatible
Curator
The curator is a background maintenance pass for agent created skills . It tracks how often each skill is viewed, used, and patched, moves long unused skills th
Persistent Memory
Hermes Agent has bounded, curated memory that persists across sessions. This lets it remember your preferences, your projects, your environment, and things it h
Memory Providers
Hermes Agent ships with 8 external memory provider plugins that give the agent persistent, cross session knowledge beyond the built in MEMORY.md and USER.md. On