Web Interface Skill for Voice Assistant

Build a production-ready Next.js web interface that connects to the Voice Assistant backend.

Architecture Overview

┌─────────────────────────────────────────────────────────────────┐
│                      BROWSER (Next.js)                          │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────────┐  │
│  │ Voice Record │  │ Chat UI      │  │ Push-to-Talk/        │  │
│  │ (MediaRecorder)│ │ (React)     │  │ Wake Word (optional) │  │
│  └──────────────┘  └──────────────┘  └──────────────────────┘  │
│           │                │                    │               │
│           └────────────────┴────────────────────┘               │
│                            │ WebSocket                          │
└────────────────────────────┼────────────────────────────────────┘
                             │
┌────────────────────────────┼────────────────────────────────────┐
│                      BACKEND (FastAPI)                          │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────────────┐  │
│  │ WebSocket    │  │ REST API     │  │ Audio Streaming      │  │
│  │ Handler      │  │ Endpoints    │  │ Processor            │  │
│  └──────────────┘  └──────────────┘  └──────────────────────┘  │
│           │                │                    │               │
│           └────────────────┴────────────────────┘               │
│                            │                                    │
│  ┌──────────────────────────────────────────────────────────┐  │
│  │              Voice Assistant Core                         │  │
│  │  STT → Intent → Memory → Planner → Tools → LLM → TTS     │  │
│  └──────────────────────────────────────────────────────────┘  │
└─────────────────────────────────────────────────────────────────┘

Quick Start

1. Backend Setup (FastAPI with WebSocket)

Create src/api/websocket_server.py:

python

from fastapi import FastAPI, WebSocket, WebSocketDisconnect
from fastapi.middleware.cors import CORSMiddleware
import asyncio
import json
import base64

app = FastAPI(title="Voice Assistant API")

app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],  # Configure for production
    allow_credentials=True,
    allow_methods=["*"],
    allow_headers=["*"],
)

class ConnectionManager:
    def __init__(self):
        self.active_connections: list[WebSocket] = []

    async def connect(self, websocket: WebSocket):
        await websocket.accept()
        self.active_connections.append(websocket)

    def disconnect(self, websocket: WebSocket):
        self.active_connections.remove(websocket)

    async def send_json(self, websocket: WebSocket, data: dict):
        await websocket.send_json(data)

manager = ConnectionManager()

@app.websocket("/ws/voice")
async def websocket_endpoint(websocket: WebSocket):
    await manager.connect(websocket)
    try:
        while True:
            data = await websocket.receive_json()
            # Process audio/text and return response
            response = await process_message(data)
            await manager.send_json(websocket, response)
    except WebSocketDisconnect:
        manager.disconnect(websocket)

2. Next.js Frontend Setup

Initialize project:

bash

npx create-next-app@latest web --typescript --tailwind --app --src-dir
cd web && npm install

3. Docker Deployment

Use docker-compose.yml from assets/docker/.

Implementation Steps

Phase 1: Backend API

Create FastAPI WebSocket server - see references/backend-api.md
Add audio streaming endpoints
Integrate with existing Voice Assistant services

Phase 2: Frontend

Set up Next.js project - see references/nextjs-setup.md
Create voice recording component with MediaRecorder API
Implement WebSocket client for real-time communication
Build chat UI with message history

Phase 3: Wake Word Alternative

Since Picovoice requires API key, use these alternatives:

Push-to-Talk: Space bar or button hold (recommended for web)
Voice Activity Detection: Browser-based VAD
OpenWakeWord: Open-source wake word (requires backend processing)

Phase 4: Deployment

Containerize with Docker - see assets/docker/
Configure nginx reverse proxy
Set up SSL/TLS for production
Deploy to cloud (AWS/GCP/Azure) or self-host

Key Components

Component	Technology	Purpose
Frontend	Next.js 14 + TypeScript	Web UI
Styling	Tailwind CSS + shadcn/ui	Modern UI components
Voice	MediaRecorder API	Browser audio capture
Real-time	WebSocket	Bidirectional communication
Backend	FastAPI	API server
Container	Docker + docker-compose	Deployment
Proxy	nginx	SSL termination, load balancing

Reference Files

references/backend-api.md - Complete FastAPI implementation
references/nextjs-setup.md - Next.js project structure and components
references/deployment.md - Docker and cloud deployment guide

Assets

assets/docker/ - Docker configuration files
assets/web-template/ - Next.js boilerplate (if needed)

Search AI Tools

web-interface

Install this agent skill to your Project

SKILL.md