glowing-telegram

A comprehensive stream and video management platform with real-time capabilities.

This is a full-featured platform for managing live streams and recordings, featuring real-time Twitch chat integration, AI-powered transcription and analysis, semantic search, stream widgets for OBS, and automated video processing pipelines. The system ingests stream recordings into a database, provides a web interface for searching and analyzing content, and includes WebSocket APIs for real-time UI updates.

I'm developing this tool live on Twitch. Why not come check it out sometime? I'm developing this tool to practice my Rust skills and to automate video processing tasks that would take hours to do manually. The project has evolved from a simple recording manager into a comprehensive streaming platform with AI integration, real-time features, and semantic search capabilities.

Features

Stream Management

Track locally recorded clips from a stream with comprehensive metadata
Generate "episodes" from streams based on speech detection and silence analysis
Archive stream videos to cloud storage (AWS S3)
Render episodes directly from selected video segments with automated processing

AI-Powered Analysis

Audio Transcription - Automated transcription using HuggingFace Whisper models with silence detection
Semantic Search - Vector embeddings with pgvector for intelligent content discovery
AI Summaries - Automatic episode summaries via GPT-4
Embedding Service - Aurora Serverless v2 with pgvector for similarity search

Real-Time Features

Twitch Chat Integration - Capture chat messages with EventSub webhooks
WebSocket API - Real-time UI updates and bidirectional communication
Stream Widgets - Dynamic OBS browser sources (countdown timers, overlays, etc.)
Live Processing - Real-time chat processing and message storage

Video Processing

Review interface for transcriptions and episode editing
Select and arrange video segments for episode creation
In-app video rendering with customizable CutLists
Automatic chapter marker generation
Silence detection for improved transcription accuracy
Audio remixing capabilities with track selection and filtering

Platform Integration

YouTube Upload - Automated upload of rendered episodes with metadata
Twitch EventSub - Webhook-based event processing
CloudFront CDN - Dynamic content delivery with versioning

Architecture

Multi-Repository Structure

The platform is divided into specialized repositories:

glowing-telegram - Backend services and infrastructure (this repository)
glowing-telegram-frontend - React-based web interface
glowing-telegram-video-editor - React component for video review and episode generation

System Architecture

API Layer:

HTTP API - RESTful CRUD operations with Cognito authentication
WebSocket API - Real-time bidirectional communication
EventSub Webhooks - Twitch event processing

Processing Layer:

Lambda Functions - Serverless API handlers and event processors
AWS Batch - GPU-accelerated transcription and video processing
SQS Queues - Asynchronous message processing
DynamoDB Streams - Change data capture for real-time updates

Storage Layer:

DynamoDB - Primary datastore for streams, episodes, widgets, chat
Aurora PostgreSQL - Vector embeddings with pgvector
S3 - Video files, audio tracks, transcripts, and assets
EFS - ML model caching for HuggingFace transformers

Delivery Layer:

CloudFront - CDN for frontend assets with dynamic origin updates
API Gateway - HTTP and WebSocket endpoint management

Recent Architectural Improvements

Performance Optimizations:

CloudFront invalidation moved to run immediately after video ingestion (not after transcription)
Silence detection integration reduces transcription time and improves accuracy
EFS model caching eliminates repeated downloads (~3GB per job)
DynamoDB streams trigger real-time WebSocket broadcasts

Reliability Enhancements:

10-minute timeout protection for Whisper processes
1KB minimum audio size threshold to skip empty files
Consistent CloudWatch logging with /glowing-telegram/* prefix
HMAC signature verification for Twitch EventSub webhooks
User-scoped CRUD API with automatic user_id injection

Scalability Improvements:

Aurora Serverless v2 auto-scales from 0.5 to 4 ACU
AWS Batch with spot instances for cost-effective GPU compute
SQS-based chat processing decouples ingestion from storage
WebSocket connection pooling for real-time features

This repository contains these directories:

Lambda Functions (Containerized)

ai_chat_lambda - OpenAI API wrapper for chat completion
chat_processor_lambda - Processes Twitch chat messages from SQS queue
crud_api - RESTful API for DynamoDB CRUD operations with user-scoped access control
media_lambda - Handles media-related operations
summarize_transcription - Episode summarization using OpenAI's API
twitch_lambda - Twitch authentication and EventSub webhook handling
websocket_lambda - WebSocket API for real-time features (Docker-based)
youtube_uploader_lambda - YouTube upload automation (Python/Docker)

Batch Processing Services

audio_transcriber - Transcribes audio using HuggingFace Whisper transformers with EFS model caching
embedding_service - Generates vector embeddings and stores them in Aurora PostgreSQL with pgvector
render_job - Video rendering pipeline
upload_video - Video upload processing
video_ingestor - Analyzes videos for silence detection, extracts audio tracks and keyframes

Shared Libraries

gt_app - Common application utilities
gt_axum - Shared Axum web framework components
gt_ffmpeg - FFmpeg interaction library
gt_secrets - AWS Secrets Manager integration
types - Shared types generated from JSON schemas (used by both backend and frontend)

Infrastructure & Tooling

cdk - AWS CDK infrastructure as code (TypeScript)
docs - JSON schemas, ER diagrams, and workflow documentation
scripts - Deployment scripts, data migration tools, S3 import utilities

Key Technologies

Infrastructure

AWS Lambda - Serverless compute for API handlers and processing
AWS Batch - GPU-accelerated transcription and video processing
Aurora Serverless v2 - PostgreSQL with pgvector for semantic search
DynamoDB - NoSQL database for streams, episodes, chat messages, and widgets
S3 - Object storage for videos, audio, and assets
CloudFront - CDN with dynamic origin updates
EFS - Model caching for HuggingFace transformers
API Gateway - HTTP and WebSocket APIs
EventBridge - Event-driven processing

AI & ML

HuggingFace Transformers - Whisper large-v3 for audio transcription
OpenAI GPT-4 - Episode summarization and chat completion
OpenAI Embeddings - text-embedding-3-small for semantic search
pgvector - Vector similarity search in PostgreSQL

Development Stack

Rust - Primary backend language for performance-critical services
TypeScript - CDK infrastructure and type definitions
Python - Lambda functions for YouTube and media operations
Docker - Containerized deployments for all services

Database Schema

DynamoDB Tables

streams - Stream metadata and configurations
video_clips - Individual video segments with timestamps
episodes - Generated episodes with transcriptions and summaries
projects - Project groupings for content organization
chat_messages - Twitch chat messages with TTL (30 days)
stream_widgets - Widget configurations and state (with GSIs for user_id, access_token, and type)

Aurora PostgreSQL

embeddings - Vector embeddings with pgvector extension for semantic search

Real-Time Features

WebSocket API

The WebSocket API provides bidirectional communication for real-time UI updates.

Authentication Methods:

Cognito JWT - Full access to all user's widgets, can execute actions
Widget Token - Read-only access to single widget (for OBS browser sources)

Message Types:

WIDGET_SUBSCRIBE - Subscribe to widget updates
WIDGET_UNSUBSCRIBE - Unsubscribe from widget
WIDGET_ACTION - Execute widget action (authenticated users only)
WIDGET_INITIAL_STATE - Initial widget state on subscription
WIDGET_CONFIG_UPDATE - Configuration change broadcast
WIDGET_STATE_UPDATE - State change broadcast

Stream Widgets

Stream widgets are synchronized UI components for OBS browser sources and web interfaces.

Widget Types:

Countdown timers
Text overlays
Custom interactive elements

Widget Configuration:

{
  "id": "uuid",
  "title": "Countdown Timer",
  "type": "countdown",
  "access_token": "uuid-for-obs-auth",
  "config": {
    "duration": 300,
    "text": "Starting soon",
    "title": "Stream Starting"
  },
  "state": {
    "duration_left": 300,
    "enabled": false,
    "last_tick_timestamp": "2025-11-22T10:00:00Z"
  }
}

Usage in OBS:

Create widget via web interface
Copy widget access URL (includes token)
Add browser source in OBS
Widget updates in real-time via WebSocket

Twitch Chat Integration

Real-time chat message capture using Twitch EventSub webhooks.

Features:

EventSub webhook subscription management
Message validation with HMAC signature verification
SQS-based message processing pipeline
Chat message storage with 30-day TTL
Automatic subscription status tracking

Architecture:

Twitch sends EventSub webhook to API Gateway
twitch_lambda validates signature and queues to SQS
chat_processor_lambda processes messages and stores in DynamoDB
Messages expire after 30 days via DynamoDB TTL

Development

Deployment

The project supports multi-environment deployments with separate infrastructure for dev, staging, and production environments.

Environments

Production: Main production environment for live users (deployed via releases)
Dev: Development environment for testing features and changes

Each environment has isolated infrastructure including separate stacks, S3 buckets, DynamoDB tables, Lambda functions, and other AWS resources. See Initial Setup Guide for first-time deployment and Normal Operation Guide for day-to-day workflows.

Production Deployment (Automated via Releases)

Production deployments are automated when you publish a release on GitHub:

Create a release on GitHub:
- Go to the GitHub repository
- Click "Releases" → "Create a new release"
- Create a new tag (e.g., v1.2.3)
- Publish the release
Automated deployment happens:
- GitHub Actions triggers the deploy.yml workflow
- Docker images are built and pushed to ECR with the release tag
- CDK deployment automatically updates production infrastructure with the new image version
- All services are updated to use the new images

Details:

Trigger: GitHub release events (when published)
Registry: Amazon ECR (159222827421.dkr.ecr.us-west-2.amazonaws.com)
Tagging: Uses the git tag from the release (e.g., v1.2.3)
Environment: Production (uses default stack names without suffix)

Development/Staging Deployment (Manual Workflow)

To deploy a specific branch to dev or staging:

Go to GitHub repository → Actions → "Deploy to Environment"
Click "Run workflow"
Select:
- Environment: dev or staging
- Branch: The branch to deploy (e.g., feature/my-feature, main)
- Image tag (optional): Existing Docker image tag, or leave empty to build from branch
Click "Run workflow"

The workflow will:

Build Docker images from the specified branch (if no image tag provided)
Tag images as {env}-{branch}-{timestamp} (e.g., dev-feature-auth-20231201-143022)
Deploy infrastructure stacks with environment suffix (e.g., AppStack-dev, FrontendStack-staging)

Workflow: .github/workflows/deploy-environment.yml

Docker Images

Available Services

All services are built as container images:

ai-chat-lambda - OpenAI chat completion wrapper
audio-transcription - HuggingFace Whisper-based transcription
chat-processor-lambda - Twitch chat message processing
crud-api - DynamoDB CRUD API with user scoping
embedding-service - Vector embeddings with Aurora/pgvector
media-lambda - Media operations handler
render-job - Video rendering pipeline
summarize-transcription - AI-powered summarization
twitch-lambda - Twitch EventSub integration
upload-video - Video upload processing
video-ingestor - Video analysis and silence detection
websocket-lambda - WebSocket API for real-time features
youtube-lambda - YouTube upload automation

Local Development

To build locally:

# Build all images with latest tag
docker buildx bake -f docker-bake.hcl all

# Build all images with custom version
IMAGE_TAG=v1.2.3 docker buildx bake -f docker-bake.hcl -f docker-bake.override.hcl all

# Build a specific image
docker buildx bake -f docker-bake.hcl crud_api

Manual CDK Deployment

For development or manual deployment, the CDK can be deployed with a specific environment and image version:

cd cdk

# Install dependencies
npm ci

# Build CDK code
npm run build

# Deploy to dev environment
ENVIRONMENT=dev IMAGE_VERSION=latest npm run cdk deploy --all

# Deploy to staging environment
ENVIRONMENT=staging IMAGE_VERSION=v1.2.3 npm run cdk deploy --all

# Deploy to production environment (default)
IMAGE_VERSION=v1.2.3 npm run cdk deploy --all

# Deploy specific stack to dev
ENVIRONMENT=dev IMAGE_VERSION=latest npm run cdk deploy AppStack-dev

Environment Configuration: Environments are defined in cdk/config/environments.json. Each environment specifies AWS account, region, and default frontend version.

Note: While manual CDK deployment is possible, deployments are normally performed via automated GitHub Actions workflows for consistency and to ensure proper environment configuration. See Initial Setup Guide for first-time deployment and Normal Operation Guide for deployment workflows.

Testing

Integration Tests

The project includes integration tests for various services. Use the run_integration_tests.sh script in the root directory to run tests for any service:

# Run integration tests for audio_transcriber
./run_integration_tests.sh audio_transcriber

# Run tests with verbose output
./run_integration_tests.sh audio_transcriber --verbose

# Build Docker image and run tests
./run_integration_tests.sh audio_transcriber --build

# Run tests without cleanup (for debugging)
./run_integration_tests.sh audio_transcriber --no-cleanup

# Run tests for other services
./run_integration_tests.sh video_ingestor
./run_integration_tests.sh embedding_service

The script automatically detects the service type (Rust, Node.js, Python) and runs the appropriate test commands. It also handles Docker image building and provides extensive configuration options.

For more information about available options:

./run_integration_tests.sh --help

Unit Tests

For Rust services, run unit tests with:

# Run all tests in workspace
cargo test --workspace

# Run tests for specific service
cd <service_directory>
cargo test

Audio Transcription

The audio_transcriber service uses HuggingFace Whisper transformers for speech-to-text.

Model: openai/whisper-large-v3

Features:

Silence detection integration to skip non-speech segments
1KB minimum audio size threshold
10-minute timeout protection
EFS-based model caching across AWS Batch jobs
GPU acceleration (g4dn instances)

EFS Model Caching:

Models are cached on EFS to avoid downloading on every job:

First run: Downloads model (~3GB), takes ~5-10 minutes
Subsequent runs: Uses cached model, much faster
Cache location: /mnt/efs/huggingface/ (mounted via EFS)
Environment: HF_HOME=/mnt/efs/huggingface

Troubleshooting:

Job hangs during model download:

# Check EFS mount in AWS Console
# Verify mount target exists in job's subnet
# Check security group allows NFS (port 2049)

Corrupted model cache:

# Delete cached model via AWS EFS Console or:
# 1. Start EC2 instance in same VPC
# 2. Mount EFS: sudo mount -t nfs4 <efs-dns>:/ /mnt/efs
# 3. Remove model: rm -rf /mnt/efs/huggingface/hub/models--openai--whisper-large-v3

Force model re-download:

# Set environment variable in batch job definition:
# HF_HUB_OFFLINE=0

Semantic Search with Embeddings

The embedding_service generates vector embeddings for semantic search capabilities.

Infrastructure:

Aurora Serverless v2 PostgreSQL with pgvector extension
Minimum capacity: 0.5 ACU, Maximum: 4 ACU
Automatic scaling based on load
VPC-isolated database cluster

Features:

Text embedding generation using OpenAI text-embedding-3-small
Vector similarity search with pgvector
Automatic embedding updates via DynamoDB streams
Integration tests with testcontainers

Usage:

Episodes/transcripts automatically generate embeddings
Embeddings stored in Aurora with pgvector
Search via vector similarity queries
Results ranked by cosine similarity

CloudWatch Logging

All logs use consistent naming conventions:

/glowing-telegram/lambda/* - Lambda functions
/glowing-telegram/apigateway/* - API Gateway access logs
/glowing-telegram/stepfunctions/* - Step Functions
/glowing-telegram/batch/* - Batch jobs

Retention: 1 week for all log groups Removal Policy: Destroy on stack deletion

Environment Variables

Services are configured via environment variables injected by CDK:

Common Variables:

AWS_REGION - AWS region (us-west-2)
DYNAMODB_TABLE - Main DynamoDB table name
INPUT_BUCKET - S3 bucket for input files
OUTPUT_BUCKET - S3 bucket for processed output

Service-Specific:

audio_transcriber:

HF_HOME=/mnt/efs/huggingface - HuggingFace model cache location
DEVICE=cuda - Use GPU acceleration (auto-detected)

embedding_service:

DATABASE_URL - Aurora PostgreSQL connection string
OPENAI_API_KEY - Retrieved from AWS Secrets Manager

twitch_lambda:

EVENTSUB_SECRET - Twitch EventSub webhook secret
CHAT_QUEUE_URL - SQS queue for chat messages

websocket_lambda:

STREAM_WIDGETS_TABLE - DynamoDB table for widgets
CONNECTIONS_TABLE - WebSocket connection tracking

youtube_uploader_lambda:

YOUTUBE_SECRETS_BASE_PATH - Base path for user YouTube credentials in Secrets Manager

Configuration Files

CDK Configuration:

cdk/config/version.json - Frontend version for CloudFront
cdk/cdk.context.json - CDK context values

Type Definitions:

types/src/types.rs - Rust types from JSON schemas
types/src/types.ts - TypeScript types from JSON schemas
Generated via ./types/import.sh

Troubleshooting

EFS Issues

Mount target not found:

Verify EFS mount target exists in job's availability zone
Check VPC subnet configuration
Ensure security groups allow NFS traffic

Permission denied on EFS:

Check EFS access point permissions
Verify IAM role has required EFS permissions
Ensure POSIX permissions are correctly set

Aurora Serverless

Connection timeout:

Verify security group allows PostgreSQL (port 5432)
Check VPC configuration and routing
Ensure Lambda/Batch is in same VPC as Aurora

Cold start issues:

Aurora Serverless v2 can pause when idle
First query may take 10-30 seconds
Consider setting minimum capacity higher for production

WebSocket API

Authentication failures:

Verify Cognito JWT is valid and not expired
Check widget access token matches database
Ensure authorizer Lambda has proper permissions

Messages not received:

Check CloudWatch logs for WebSocket Lambda
Verify DynamoDB stream is enabled and connected
Check connection ID is still valid (connections expire)

Contributing

Adding a New Container Service

When adding a new container-based service to the project, you must update all of the following files to ensure proper deployment:

Dockerfile - Add a new build stage for your service
docker-bake.hcl - Add a target for your service in the appropriate batch group
docker-bake.override.hcl - Add a target with ${IMAGE_TAG} variable for release tagging
cdk/lib/repoStack.ts - Add the repository name to the names array in RepoConstruct
scripts/push_image.sh - Add a case statement for individual deployments (if needed)
README.md - Add the service to the "Available Services" list

Example: Adding a service called my-service

// cdk/lib/repoStack.ts
new RepoConstruct(this, 'RepoConstruct', {
  namespace: 'glowing-telegram',
  names: [
    'ai-chat-lambda',
    'audio-transcription',
    'chat-processor-lambda',
    'crud-lambda',
    'embedding-service',
    'media-lambda',
    'render-job',
    'summarize-transcription-lambda',
    'twitch-lambda',
    'upload-video',
    'video-ingestor',
    'websocket-lambda',
    'youtube-lambda',
    'youtube-uploader-lambda',
    'my-service',  // Add your service here
  ],
});

# docker-bake.hcl
target "my_service" {
  dockerfile = "Dockerfile"
  context = "."
  target = "my_service"
  tags = ["159222827421.dkr.ecr.us-west-2.amazonaws.com/glowing-telegram/my-service:latest"]
}

# docker-bake.override.hcl
target "my_service" {
  tags = [
    "159222827421.dkr.ecr.us-west-2.amazonaws.com/glowing-telegram/my-service:${IMAGE_TAG}"
  ]
}

# scripts/push_image.sh
case $SERVICE in
  # ... other services ...
  my-service)
    docker buildx bake -f docker-bake.hcl -f docker-bake.override.hcl my_service --push
    ;;
esac

Why all these files?

Dockerfile - Multi-stage build definition for your service
docker-bake.hcl - Defines how to build the image locally
docker-bake.override.hcl - Enables versioned tagging for releases
cdk/lib/repoStack.ts - Creates the ECR repository in AWS
scripts/push_image.sh - Allows manual deployment of individual services
README.md - Documents the service in "Available Services" list

Deployment Flow:

Release published on GitHub triggers automated workflow
Docker images built and pushed to ECR with release tag
CDK deployment updates infrastructure with new image versions
Services automatically updated with new images

Important Notes:

All Lambda functions must use containerized deployments
Python lambdas should use the Python 3 runtime base
Rust services compile in Docker with cargo-lambda
GPU services (transcription) use g4dn instances in AWS Batch

If you're interested in contributing beyond adding services, please reach out to me on Twitch or Bluesky.

Version History

For a complete version history, see Releases.

Name		Name	Last commit message	Last commit date
Latest commit History 689 Commits
.github		.github
.vscode		.vscode
ai_chat_lambda		ai_chat_lambda
audio_transcriber		audio_transcriber
cdk		cdk
chat_processor_lambda		chat_processor_lambda
crud_api		crud_api
docs		docs
embedding_service		embedding_service
gt_app		gt_app
gt_axum		gt_axum
gt_ffmpeg		gt_ffmpeg
gt_secrets		gt_secrets
ingestion_management_lambda		ingestion_management_lambda
media_lambda		media_lambda
render_job		render_job
scripts		scripts
summarize_transcription		summarize_transcription
twitch_lambda		twitch_lambda
types		types
upload_video		upload_video
video_ingestor		video_ingestor
websocket_lambda		websocket_lambda
widget_updater_lambda		widget_updater_lambda
youtube_lambda		youtube_lambda
youtube_uploader_lambda		youtube_uploader_lambda
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
biome.json		biome.json
cdk.context.json		cdk.context.json
docker-bake.hcl		docker-bake.hcl
docker-bake.override.hcl		docker-bake.override.hcl
package-lock.json		package-lock.json
run_integration_tests.sh		run_integration_tests.sh
rustfmt.toml		rustfmt.toml

License

saebyn/glowing-telegram

Folders and files

Latest commit

History

Repository files navigation

glowing-telegram

Features

Stream Management

AI-Powered Analysis

Real-Time Features

Video Processing

Platform Integration

Architecture

Multi-Repository Structure

System Architecture

Recent Architectural Improvements

Lambda Functions (Containerized)

Batch Processing Services

Shared Libraries

Infrastructure & Tooling

Key Technologies

Infrastructure

AI & ML

Development Stack

Database Schema

DynamoDB Tables

Aurora PostgreSQL

Real-Time Features

WebSocket API

Stream Widgets

Twitch Chat Integration

Development

Deployment

Environments

Production Deployment (Automated via Releases)

Development/Staging Deployment (Manual Workflow)

Docker Images

Available Services

Local Development

Manual CDK Deployment

Testing

Integration Tests

Unit Tests

Audio Transcription

Semantic Search with Embeddings

CloudWatch Logging

Environment Variables

Configuration Files

Troubleshooting

EFS Issues

Aurora Serverless

WebSocket API

Contributing

Adding a New Container Service

Version History

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 44

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages