bijit/VoiceAgent

Fork 0

mirror of https://github.com/Bijit-Mondal/VoiceAgent.git synced 2026-03-02 10:36:37 +00:00

Go to file

Bijit Mondal c1cd705d49 voice agent works

2026-02-13 17:33:22 +05:30

example

voice agent works

2026-02-13 17:33:22 +05:30

src

voice agent works

2026-02-13 17:33:22 +05:30

.gitignore

init:

2026-02-13 17:33:22 +05:30

LICENSE

Initial commit

2026-02-13 17:32:22 +05:30

output.mp3

voice agent works

2026-02-13 17:33:22 +05:30

package.json

init:

2026-02-13 17:33:22 +05:30

pnpm-lock.yaml

init:

2026-02-13 17:33:22 +05:30

README.md

init:

2026-02-13 17:33:22 +05:30

tsconfig.json

init:

2026-02-13 17:33:22 +05:30

README.md

voice-agent-ai-sdk

Minimal voice/text agent SDK built on AI SDK with optional WebSocket transport.

Current status

Text flow works via sendText() (no WebSocket required).
WebSocket flow works when connect() is used with a running WS endpoint.
Voice streaming is not implemented yet.

Prerequisites

Node.js 20+
pnpm
OpenAI API key

Setup

Install dependencies:

pnpm install
Configure environment variables in .env:

OPENAI_API_KEY=your_openai_api_key VOICE_WS_ENDPOINT=ws://localhost:8080

Run (text-only check)

This validates model + tool calls without requiring WebSocket:

pnpm demo

Expected logs include text events and optional tool_start.

Run (WebSocket check)

Start local WS server:

pnpm ws:server
In another terminal, run demo:

pnpm demo

The demo will:

run sendText() first (text-only sanity check), then
connect to VOICE_WS_ENDPOINT if provided.

Scripts

pnpm build – build TypeScript
pnpm dev – watch TypeScript
pnpm demo – run demo client
pnpm ws:server – run local test WebSocket server

Notes

If VOICE_WS_ENDPOINT is empty, WebSocket connect is skipped.
The sample WS server sends a mock transcript message for end-to-end testing.

README.md Unescape Escape

voice-agent-ai-sdk

Current status

Prerequisites

Setup

Run (text-only check)

Run (WebSocket check)

Scripts

Notes

README.md