Assistant mode: voice only by richiejp · Pull Request #40 · richiejp/VoxInput

richiejp · 2025-12-31T16:03:15Z

Eventually Assistant Mode will allow you to control your desktop with voice using natural speech and also allow a VLM to describe what is on the desktop. We can use MCP servers (tool calls) or a VLM which will be able to locate the coordinates of items on the desktop and click them.

Initially though this PR will just allow you to speak with an LLM using audio both ways over the OpenAI realtime API.

For LocalAI support this requires mudler/LocalAI#6245 which will implement the conversational parts of the API before we move onto tool calls and multi-modal support needed for a full desktop assistant.

feat: Add experimental assistant mode with audio conversations

0d0da49

richiejp force-pushed the feat/assistant-mode branch from 7d35423 to 0d0da49 Compare January 7, 2026 14:31

richiejp changed the title ~~Assistant mode~~ Assistant mode: voice only Jan 7, 2026

richiejp merged commit 6dd5344 into main Jan 7, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assistant mode: voice only#40

Assistant mode: voice only#40
richiejp merged 1 commit intomainfrom
feat/assistant-mode

richiejp commented Dec 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

richiejp commented Dec 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant