bijit/VoiceAgent

mirror of https://github.com/Bijit-Mondal/VoiceAgent.git synced 2026-03-02 18:36:39 +00:00

Files

Bijit Mondal 77bac597e4 init:

2026-02-13 17:33:22 +05:30

1.2 KiB

Raw Blame History

voice-agent-ai-sdk

Minimal voice/text agent SDK built on AI SDK with optional WebSocket transport.

Current status

Text flow works via sendText() (no WebSocket required).
WebSocket flow works when connect() is used with a running WS endpoint.
Voice streaming is not implemented yet.

Prerequisites

Node.js 20+
pnpm
OpenAI API key

Setup

Install dependencies:

pnpm install
Configure environment variables in .env:

OPENAI_API_KEY=your_openai_api_key VOICE_WS_ENDPOINT=ws://localhost:8080

Run (text-only check)

This validates model + tool calls without requiring WebSocket:

pnpm demo

Expected logs include text events and optional tool_start.

Run (WebSocket check)

Start local WS server:

pnpm ws:server
In another terminal, run demo:

pnpm demo

The demo will:

run sendText() first (text-only sanity check), then
connect to VOICE_WS_ENDPOINT if provided.

Scripts

pnpm build – build TypeScript
pnpm dev – watch TypeScript
pnpm demo – run demo client
pnpm ws:server – run local test WebSocket server

Notes

If VOICE_WS_ENDPOINT is empty, WebSocket connect is skipped.
The sample WS server sends a mock transcript message for end-to-end testing.