AI Document Organizer

An intelligent document organization application powered by AI, supporting both Google Gemini and OpenAI models.

Overview

The AI Document Organizer helps you automatically organize your documents by analyzing their content with advanced AI models. The application can process and categorize multiple file formats, creating a structured folder system with meaningful categories based on document content.

Project Structure

AI-Document-Organizer/
├── src/                      # All source code
│   ├── ai_analyzer.py        # Google Gemini AI integration
│   ├── openai_analyzer.py    # OpenAI integration
│   ├── ai_service_factory.py # Factory for creating AI services
│   ├── file_analyzer.py      # Document scanning and analysis
│   ├── file_organizer.py     # Document organization
│   ├── file_parser.py        # Content extraction from files
│   ├── gui.py                # User interface
│   ├── settings_manager.py   # Application settings
│   ├── media_analyzer.py     # Audio/video file analysis
│   ├── transcription_service.py # Audio transcription service
│   ├── cloud_integration.py  # Cloud storage integration
│   ├── organization_scheme.py # Organization scheme management
│   ├── templates/            # Organization templates
│   └── utils.py             # Helper utilities
├── docs/                     # Documentation
│   ├── README.md             # User guide
│   ├── QUICK_START_GUIDE.md  # Quick start guide
│   ├── DEVELOPER_GUIDE.md    # Developer documentation
│   └── ALTERNATIVE_AI_MODELS.md # AI model information
├── assets/                   # Application assets
│   └── generated-icon.png    # Application icon
├── packaging/                # Packaging-related files
│   ├── ai_document_organizer.spec # PyInstaller specification
│   ├── installer.nsi         # NSIS installer script
│   └── build_exe.py         # Build script
├── tests/                    # Test files
├── main.py                   # Main entry point
├── requirements.txt          # Dependencies
└── README.md                 # This file

Features

Smart Document Analysis: Uses Google Gemini or OpenAI models to understand document content
Multiple AI Model Support:
- Google Gemini models (2.0 Flash, 1.5 Flash, 1.5 Pro, etc.)
- OpenAI models (GPT-4, GPT-4 Turbo, GPT-3.5 Turbo, etc.)
In-App API Key Management: Enter and save API keys directly in the settings
Model Selection: Choose from available AI models for each service
Automatic Categorization: Creates logical folder structure based on document topics and content
Multi-Format Support: Works with various file types:
- Documents: CSV, Excel, HTML, Markdown, Text, Word
- Images: JPG, PNG, GIF, BMP, TIFF, WebP
- Audio: MP3, WAV, FLAC, AAC, OGG, M4A
- Video: MP4, AVI, MKV, MOV, WMV, WebM, FLV
Content Extraction: Automatically extracts and analyzes text from all supported formats
Media Analysis:
- Audio file analysis (duration, bitrate, channels, etc.)
- Video file analysis (resolution, frame rate, codecs, etc.)
- Audio waveform generation
- Video thumbnail generation
- Audio transcription with multiple providers
Cloud Storage Integration:
- Support for Google Drive, OneDrive, and Dropbox
- Bidirectional synchronization
- Conflict resolution
- Selective sync by file type
- Bandwidth control
Organization Schemes:
- Import/export organization rules
- Predefined templates for common use cases
- Custom rule creation
- Rule conflict detection
- Scheme merging and validation
Windows-Optimized: Native Windows interface with proper file handling
Batch Processing: Processes files in configurable batches to optimize performance
Rate Limiting Controls: Configure batch size and delay to avoid API rate limits

Requirements

Windows 10/11
Python 3.8 or higher
API key for Google Gemini or OpenAI
FFmpeg for audio/video processing
Optional: Cloud storage provider credentials

Installation

Clone the repository:

git clone https://github.com/yourusername/ai-document-organizer.git
cd ai-document-organizer

Install dependencies:
```
pip install -r requirements.txt
```
Install FFmpeg:
- Download from FFmpeg official website
- Add FFmpeg to your system PATH

Set up your API keys:

In the application settings (recommended)

Or as environment variables:

# For Google Gemini API
set GOOGLE_API_KEY=your_api_key_here

# OR for OpenAI API
set OPENAI_API_KEY=your_api_key_here

# For cloud storage (optional)
set GOOGLE_DRIVE_CREDENTIALS=path_to_credentials.json
set ONEDRIVE_CLIENT_ID=your_client_id
set ONEDRIVE_CLIENT_SECRET=your_client_secret
set DROPBOX_APP_KEY=your_app_key
set DROPBOX_APP_SECRET=your_app_secret

Running the Application

python main.py

Documentation

Organization Templates

The application comes with predefined organization templates:

Media Organization: Rules for organizing audio and video files by type, metadata, resolution, and duration
Cloud Storage Sync: Configuration for synchronizing files with cloud storage providers
Custom Templates: Create and share your own organization schemes

Cloud Storage Support

Supported cloud storage providers:

Google Drive: Full integration with Google Drive API
OneDrive: Integration with Microsoft Graph API
Dropbox: Integration with Dropbox API

Features:

Bidirectional synchronization
Selective sync by file type
Conflict resolution
Version control
Bandwidth management
Progress tracking
Error handling and retry mechanisms

Packaging for Distribution

To create a standalone Windows executable and installer, see the Packaging Guide.

License

MIT License - See LICENSE.txt for details.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
.cursor/rules		.cursor/rules
ROADMAP		ROADMAP
ai_document_organizer_v2		ai_document_organizer_v2
assets		assets
config		config
dist/AI Document Organizer		dist/AI Document Organizer
docs		docs
packaging		packaging
src		src
.cursorignore		.cursorignore
.gitignore		.gitignore
.replit		.replit
CHANGELOG.md		CHANGELOG.md
README.md		README.md
generated-icon.png		generated-icon.png
main.py		main.py
pyproject.toml		pyproject.toml
replit.nix		replit.nix
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Document Organizer

Overview

Project Structure

Features

Requirements

Installation

Running the Application

Documentation

Organization Templates

Cloud Storage Support

Packaging for Distribution

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

whoisdsmith/SmartFileOrganizer

Folders and files

Latest commit

History

Repository files navigation

AI Document Organizer

Overview

Project Structure

Features

Requirements

Installation

Running the Application

Documentation

Organization Templates

Cloud Storage Support

Packaging for Distribution

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages