AI Video Generator

Version: 3.3 | Last Updated: 2025-12-06

An automated video creation tool that transforms a simple topic into a complete, share-ready video with an AI-generated script, professional voiceover, relevant visuals, and an eye-catching thumbnail.

Overview

The AI Video Generator transforms video content creation from a multi-hour, multi-tool process into a streamlined 20-minute workflow. It's designed for content creators who value production speed and quality but lack the time, budget, or technical skills for traditional video production.

Our key value proposition is a local-first, privacy-focused architecture. Unlike cloud-dependent solutions, our system runs primarily on your own hardware using free and open-source (FOSS) technologies like Ollama and local TTS. For those seeking enhanced quality, the system seamlessly integrates with powerful cloud services like Google Gemini and ElevenLabs, maintaining a hybrid "FOSS-first, cloud-enhanced" approach with zero mandatory subscription costs.

Core Features (Complete)

Conversational AI Agent: Brainstorm and refine video ideas through a natural chat interface.
Automated Script Generation: Get a professional, human-sounding script divided into logical scenes.
LLM Configuration & Script Personas: Choose between local (Ollama) or cloud (Gemini) models and select from personas like 'Scientific Analyst' or 'Blackpill Realist' to define your content's tone.
Voice Selection & Synthesis: Select from a diverse catalog of local (FOSS) or cloud (ElevenLabs) voices to narrate your script.
AI-Powered Visual Sourcing: Automatically finds relevant B-roll from YouTube, using Google Cloud Vision to filter out talking heads, captions, and irrelevant content.
Visual Curation UI: Review and select the perfect video clip for each scene from AI-powered suggestions.
Automated Video Assembly: Automatically combines your selected visuals and voiceover into a final MP4 video using FFmpeg.
Automated Thumbnail Generation: Instantly get a compelling thumbnail with a relevant background and your video's title.

Enhancement Features (In Development)

Automated Background Music: Automatically selects, mixes, and applies topic-appropriate background music.
AI-Generated SEO Toolkit: A VidIQ-style command center that generates optimized titles, descriptions, and tags, plus keyword research and pre-upload SEO audits.
Automate Mode: A full, one-click automation pipeline from topic confirmation to final export.
ElevenLabs TTS Integration: Use premium cloud-based voices from ElevenLabs as an alternative to the local TTS engine.
Unified API Usage Dashboard: A single dashboard to monitor your usage and quotas for all integrated APIs (Gemini, YouTube, ElevenLabs).

Technology Stack

Frontend & Framework

Next.js 15 - React framework with App Router
TypeScript - Type-safe development
Tailwind CSS v4 - Utility-first styling
shadcn/ui - Accessible component library

AI & Data Processing

LLM: Ollama (Llama 3.2) for local processing, Google Gemini for cloud-enhanced generation.
TTS: KokoroTTS (local, FOSS), ElevenLabs (cloud).
Visuals: YouTube Data API for sourcing, Google Cloud Vision API for filtering.
Audio/Video: FFmpeg for assembly, yt-dlp for downloading.
Databases: SQLite for project data, ChromaDB/LanceDB for future RAG capabilities.
State Management: Zustand for lightweight client state.

Prerequisites

Required

Node.js 18+
Python 3.9+
UV Package Manager
Ollama (with a pulled model, e.g., llama3.2)
FFmpeg 7.1+

Setup

# Clone repository
git clone https://github.com/AIfriendly/AIvideogen.git
cd AIvideogen/ai-video-generator

# Install Node.js dependencies
npm install

# Install Python dependencies (from parent directory)
cd ..
uv pip install -r requirements.txt
cd ai-video-generator

# Start development server
npm run dev

Open http://localhost:3000 to use the application.

Future Enhancements

We are actively working on expanding our capabilities with features like:

Domain-Specific Video Sources: Adding official sources like DVIDS for military footage.
Local Computer Vision: A FOSS alternative to Google Vision using MediaPipe and Tesseract.js for zero-cost, private analysis.
RAG-Powered Channel Intelligence: A VidIQ-style system to analyze competitors, monitor trends, and generate scripts informed by your specific niche.

License

MIT License - See LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
.cache/audio		.cache/audio
__tests__/unit/tts		__tests__/unit/tts
config		config
mcp_servers		mcp_servers
output/validation		output/validation
public		public
scripts		scripts
src		src
tests		tests
.cache_ggshield		.cache_ggshield
.env.example		.env.example
.env.local.example		.env.local.example
.gitignore		.gitignore
README.md		README.md
STORY-4.1-IMPLEMENTATION-SUMMARY.md		STORY-4.1-IMPLEMENTATION-SUMMARY.md
TESTING-SETUP.md		TESTING-SETUP.md
check-db.js		check-db.js
components.json		components.json
eslint.config.mjs		eslint.config.mjs
list-gemini-models.js		list-gemini-models.js
manual-test-results-story-2.5.json		manual-test-results-story-2.5.json
manual-test-story-2.5.js		manual-test-story-2.5.js
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
pytest.ini		pytest.ini
quick-test-voiceover.js		quick-test-voiceover.js
requirements.txt		requirements.txt
run_tests.bat		run_tests.bat
run_tests.py		run_tests.py
srccomponentsfeaturescurationVideoPreviewPlayer.tsx		srccomponentsfeaturescurationVideoPreviewPlayer.tsx
test-amix.aac		test-amix.aac
test-api-scenes.js		test-api-scenes.js
test-debug.js		test-debug.js
test-gemini-models.js		test-gemini-models.js
test-story-4.1.js		test-story-4.1.js
tracked_files.txt		tracked_files.txt
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts
yt-dlp.exe		yt-dlp.exe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Video Generator

Overview

Core Features (Complete)

Enhancement Features (In Development)

Technology Stack

Frontend & Framework

AI & Data Processing

Prerequisites

Required

Setup

Future Enhancements

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

AIfriendly/AIvideogen

Folders and files

Latest commit

History

Repository files navigation

AI Video Generator

Overview

Core Features (Complete)

Enhancement Features (In Development)

Technology Stack

Frontend & Framework

AI & Data Processing

Prerequisites

Required

Setup

Future Enhancements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages