Voice Agent Prototype

A real-time voice AI agent using LiveKit and Google's Gemini Realtime API for natural conversation.

What It Does

Real-time bidirectional voice conversation with AI
Natural speech processing and response generation
Web-based interface for easy access
Continuous conversation flow (not just single responses)

Built With

LiveKit - Real-time audio streaming
Google Gemini API - AI conversation model
Flask - Web backend for token generation
HTML/JavaScript - Browser-based voice interface

Quick Start

Option 1: Docker (Recommended)

Prerequisites: Install Docker Desktop

Set up environment:
```
cp env.example .env
```
Edit .env with your LiveKit and Google Cloud credentials.
Run with Docker:
```
docker-compose up --build
```
Start conversation:
- Open http://localhost:5000
- Click "Join Conversation"
- Allow microphone access
- Start talking with the AI agent

Option 2: Local Development

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment:
```
cp env.example .env
```
Edit .env with your LiveKit and Google Cloud credentials.
Run the application:
```
python run_webui.py
```
Start conversation:
- Open http://localhost:5000
- Click "Join Conversation"
- Allow microphone access
- Start talking with the AI agent

Environment Variables

LIVEKIT_URL=wss://your-livekit-server.livekit.cloud
LIVEKIT_API_KEY=your_livekit_api_key
LIVEKIT_API_SECRET=your_livekit_api_secret
GOOGLE_API_KEY=your_google_api_key

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
scripts		scripts
styles		styles
templates		templates
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
agent.py		agent.py
deploy.bat		deploy.bat
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
env.example		env.example
prompts.py		prompts.py
requirements.txt		requirements.txt
run_webui.py		run_webui.py
webui_server.py		webui_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice Agent Prototype

What It Does

Built With

Quick Start

Option 1: Docker (Recommended)

Option 2: Local Development

Environment Variables

About

Uh oh!

Releases

Packages

Uh oh!

Languages

umara25/VoiceAssistant

Folders and files

Latest commit

History

Repository files navigation

Voice Agent Prototype

What It Does

Built With

Quick Start

Option 1: Docker (Recommended)

Option 2: Local Development

Environment Variables

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages