Note: GenerALL.AI refers to a "General Purpose AI", not a military general.
GALL.AI (General AI) is a sophisticated multimodal agent system with telegram bot communication interface powered by advanced AI models like Claude 3.7 Sonnet and GPT-4o. It provides a seamless interface for users to interact with AI capabilities through Telegram, supporting text, voice, images, documents, and more.
- Multimodal Agent System: Process and respond to text, voice, images, and documents within a single conversation
- Tool-Based Architecture: Powered by specialized modules for tasks like web search, file operations, code execution, and more
- Extensible Framework: Built on a modular system that can be expanded with new capabilities
- Natural Interface: Communicate with advanced AI models through the familiar Telegram messaging platform
- Secure Execution: All operations run in isolated environments with proper permission controls
- π Multimodal Support: Process and respond to text, voice messages, images, PDFs, and other document types
- π€ Voice Processing: Transcribe voice messages and generate voice responses with customizable voices via ElevenLabs
- πΌοΈ Image Analysis: Describe and analyze images using state-of-the-art vision models
- πΈ Multiple Photo Processing: Analyze multiple images simultaneously to describe, compare differences, and redraw content
- π¨ Image Generation: Create new images based on text descriptions or modifications of existing images
- π Generate with DALL-E 3: Create single images with OpenAI's DALL-E 3 model
- β¨ Multimodal Generation with Gemini: Generate stories with matching images using Google's Gemini model
- ποΈ Image-to-Image Transformation: Edit and transform existing images with powerful AI editing capabilities
- π Document Processing: Extract and analyze content from PDFs and other documents with deep understanding of the content
- β° Reminder System: Create and manage reminders with natural language processing for both specific times and contextual events
- π Task Scheduling: Schedule tasks to execute based on specific times or event triggers like "after a match ends"
- π Secure Container System: Run unsafe commands in isolated Docker containers
- π» Terminal Access: Execute system commands securely through the bot interface
- π¦ Package Management: Install and run packages in a secure containerized environment
- π File Sharing: Send files directly to users through Telegram
- π± SMS Messaging: Send SMS text messages to phone numbers via Twilio integration
- βοΈ Customizable Settings: Fine-tune the assistant's behavior through an interactive settings menu
- π§ Persistent Memory: Maintain conversation context across sessions
- π Advanced Agent Architecture: Powered by a modular agent system with specialized tools for different tasks
- πΎ Memory Management: Smart conversation history handling with summarization capabilities
- π Web Search: Search the web for up-to-date information using multiple providers (Perplexity, Tavily)
- π§© Reasoning Systems: Multiple stages of reasoning with critique and judge mechanisms
- βοΈ Cloud Storage: Upload and manage files on S3-compatible storage
- π» Code Execution: Write and execute Python and Bash scripts in secure environments
- π¬ In-Process Communication: Send Telegram messages, files, or reactions directly during thinking operations through tool-based interactions, not just as final bot output
- π SSH Connections: Securely connect to remote servers via SSH to perform system administration tasks and retrieve information
- π Shodan API Integration: Leverage Shodan's powerful internet scanning capabilities to gather intelligence on networks, devices, and potential security issues
- π οΈ Advanced Tools & Techniques: Utilize specialized cybersecurity, networking, and data analysis tools for comprehensive assessments and operations
- ποΈ Invite System: Control access through user invites with admin oversight
- Users can create and share limited invite codes (5 per user)
- Admins have unlimited invite generation capability
- Track invite usage and user access via admin commands
- Automatically notify admins when new users join
- Control all memory aspect
- Long term memory
- Short term memory
- Previous conversation full thinking context
- RAG automated memory
- Summarization memory
Here are some advanced scenarios that demonstrate the power and versatility of GenerALL.AI:
User: "Research the latest developments in quantum computing in 2023-2024. Save the results as a PDF and send it to me."
Bot: "I'll research the latest developments in quantum computing and prepare a comprehensive report for you."
[Conducts research using Perplexity and Tavily integrations]
[Generates a structured report in Markdown format]
[Converts to PDF]
[Uploads to S3 storage]
[Sends file to user via Telegram]
"Here's your research report on quantum computing advancements in 2023-2024. I've included sections on hardware breakthroughs, algorithm developments, and industry applications."
User: "Connect to my production server at ssh://user@example.com and check disk usage on all volumes."
Bot: "I'll connect to your production server and check the disk usage."
[Establishes secure SSH connection]
[Executes df -h command]
[Formats results in a readable table]
"Here's the current disk usage on your production server:
Filesystem Size Used Avail Use% Mounted on
/dev/sda1 50G 23G 25G 48% /
/dev/sdb1 1T 750G 230G 77% /data"
User: "Scan my website https://example.com for common security vulnerabilities."
Bot: "I'll perform a security scan of your website. This might take a few minutes."
[Sets up a secure container for scanning]
[Installs and runs security tools]
[Analyzes results and compiles a security report]
"I've completed the security scan of your website. I found:
- 2 potential XSS vulnerabilities in your contact form
- Outdated SSL certificate (expires in 15 days)
- Missing HTTP security headers
I've attached a detailed report with recommendations for fixing these issues."
User: "Create a story about a space explorer discovering an ancient alien civilization. Include images for each key scene."
Bot: "I'll create a multimodal story about a space explorer discovering an ancient alien civilization."
[Generates a complete story with matching images for each key scene]
[Sends text and images together in a coherent narrative]
"The Stellar Archaeologist - A space explorer's journey to the lost civilization of Proxima Centauri IV..."
[Image of a space explorer approaching an alien planet]
"As Captain Elara approached the crystalline structures, their surfaces shimmered with an otherworldly light..."
[Image of alien crystal structures with mysterious glowing patterns]
User: "This is great! Can you transform the second image to add more alien symbols and make it look more mysterious?"
Bot: "I'll transform that image to add more alien symbols and enhance the mysterious atmosphere."
[Takes the existing image and applies the requested transformations]
[Sends the modified image with new alien symbols and enhanced mysterious elements]
"Here's the transformed image with additional alien symbols and a more mysterious atmosphere. I've added ancient hieroglyphs that glow with an eerie blue light and enhanced the shadowy elements to create a sense of the unknown."
User: "Monitor the NASA website and alert me when the next SpaceX launch is scheduled. Remind me 1 day before and again 1 hour before the launch."
Bot: "I'll monitor the NASA and SpaceX websites for upcoming launch information and set up the reminders for you."
[Sets up regular monitoring of relevant websites]
[Identifies next launch date]
[Creates two context-based reminders]
"I've set up monitoring for the next SpaceX launch. Currently, the next launch is scheduled for June 15th at 3:00 PM EDT. I'll remind you on June 14th at 3:00 PM and again on June 15th at 2:00 PM."
User: "Write a Python script that analyzes a CSV file of stock prices, calculates moving averages, and generates a visualization. Use pandas and matplotlib."
Bot: "I'll create a Python script for stock price analysis with moving averages visualization."
[Writes Python code using pandas and matplotlib]
[Installs required libraries in a secure container]
[Executes the code with sample data]
[Generates visualization]
[Sends the resulting graph and script file to the user]
"I've created and executed a stock analysis script that:
1. Loads CSV data using pandas
2. Calculates 7-day and 30-day moving averages
3. Generates a visualization with original prices and both moving averages
4. Highlights potential buy/sell signals
I've attached both the visualization and the Python script. You can modify the script
to use your own data by changing the file path in line 12."
User: "/invite"
Bot: "ποΈ New Invite Created
Share this link: https://t.me/YourBotName?start=invite_a1b2c3d4
Or use this command:
/invite a1b2c3d4
Invites remaining: 4/5"
[Later, when someone uses the invite]
New User: "/start invite_a1b2c3d4"
Bot: "β
Invite accepted! You now have access to the bot.
π Welcome to Generall.AI bot! Use me to get AI assistance.
You can send me messages, voice recordings, or images to analyze."
[Admin receives notification]
Bot to Admin: "π New user joined!
User ID: `123456789`
π Invited by: `987654321`
π Total users: 3"
Customize voice parameters including voice model selection, stability, clarity, and style. The bot can both listen to your voice messages and respond with generated voice using ElevenLabs.
Upload PDFs to extract and analyze content. The bot can understand complex documents, summarize contents, answer questions about the document, and provide insights.
GenerALL.AI offers multiple powerful image generation and transformation capabilities:
Generate high-quality single images with OpenAI's DALL-E 3 model, with customizable parameters:
- Control image size and quality
- Detailed prompt capabilities
- Supports art styles, photography styles, and concept visualization
Create rich stories with matching images using Google's advanced Gemini model:
- Generate text and images simultaneously within a single cohesive experience
- Create multiple images that match the narrative flow
- Support for various artistic styles (3D digital art, photorealistic, cartoon, anime, etc.)
- Perfect for storytelling, educational content, and creative projects
Transform and edit existing images with AI:
- Add or remove elements from images
- Change styles, colors, or artistic approaches
- Apply creative modifications based on text instructions
- Useful for design iterations, creative exploration, and visual problem-solving
- Works with images the user has sent previously
GenerALL.AI supports processing and analysis of various file formats including:
- Documents: PDF, TXT, DOCX, MD
- Data Files: JSON, JSONL, CSV, XLSX, XLS
- Code Files: PY, JS, HTML, CSS, PHP, SQL
- Configuration Files: XML, YAML, YML, TOML, INI, CONF
- Shell Scripts: SH, BAT, PS1
- System Files: LOG
The bot can extract content, analyze structure, and help you understand the information contained within these supported file types.
GenerALL.AI supports processing and analysis of various media formats including:
- Images: JPG, JPEG, PNG, GIF*, BMP*, WEBP*
- Audio: Telegram Voice Message, mp3*, ogg*
- Video: -/-
The bot can:
- Images: Analyze content, detect objects, read text (OCR), and describe scenes
- Audio: Transcribe speech, analyze audio content, and detect language and answer using voice generation
- Video: -/-
- π Python Development: Create, edit, and execute Python scripts in a secure environment
- π§ Bash Scripting: Run Bash scripts and system commands safely
- π¦ Package Installation: Install Python libraries and dependencies as needed
- π Data Analysis: Process and visualize data with popular libraries like pandas, numpy, and matplotlib
- π Automated Workflows: Create scripts for repetitive tasks and automated data processing
- π Secure Execution: All code runs in isolated containers for security
- π Code Editing: Iteratively improve code based on requirements and feedback
- Python 3.12+
- Docker and Docker Compose
- FFmpeg (for audio processing)
- Git
- Telegram Bot Token
- Anthropic API Key (for Claude 3.7)
- OpenAI API Key (for GPT-4o and Whisper)
- ElevenLabs API Key (for voice synthesis)
- Google API Key (for Gemini image generation/transformation)
-
Clone the repository:
git clone https://github.com/your-username/generall.ai.git cd generall.ai -
Create an
.envfile based on the provided.env.example:cp .env.example .env
-
Edit the
.envfile with your API keys and configuration. -
Build and start the Docker container:
docker-compose up --build
-
Clone the repository:
git clone https://github.com/your-username/generall.ai.git cd generall.ai -
Create and activate a Python virtual environment:
python -m venv .venv source .venv/bin/activate # On Windows: .venv\Scripts\activate
-
Install the dependencies:
pip install -r app/requirements.txt
-
Create and configure your
.envfile. -
Install FFmpeg system-wide or use the included ffmpeg-downloader.
-
Run the bot:
python app/main_bot.py
- π» Operating Systems: Linux (recommended for production), macOS, Windows
- π Deployment: Docker-based deployment supported across all major platforms
- π Python Version: 3.12+ required
- π₯οΈ Hardware Requirements:
- Minimum: 4GB RAM, 2 CPU cores
- Recommended: 8GB+ RAM, 4+ CPU cores (especially for handling multiple conversations)
- π Network: Requires internet connection for API access
The application is configured via environment variables in the .env file. Key configuration options include:
ANTHROPIC_API_KEY: API key for Anthropic Claude modelsOPENAI_API_KEY: API key for OpenAI GPT modelsGOOGLE_API_KEY: API key for Gemini AI modelsOPENAI_API_KEY_WHISPER: API key for OpenAI Whisper (voice transcription)TAVILY_API_KEY: API key for Tavily search integrationPERPLEXITY_API_KEY: API key for Perplexity search integrationELEVENLABS_API_KEY: API key for ElevenLabs voice synthesis
TELEGRAM_BOT_TOKEN: Your Telegram bot token from BotFatherTELEGRAM_CHAT_ID: Comma-separated list of initially allowed Telegram chat IDsTELEGRAM_ADMIN_ID: Telegram chat ID for the admin user (has unlimited invites and can list all users)
MAX_AGENT_TOOLS_ITERATIONS: Maximum number of tool use iterations (default: 65)MAX_AGENT_CRITIQUE_ITERATIONS: Maximum number of critique iterations (default: 0)
S3_HOST: S3-compatible storage host URLS3_ACCESS_KEY: S3 access key for authenticationS3_SECRET_KEY: S3 secret key for authenticationS3_BUCKET_NAME: Name of the S3 bucket to useS3_PATH_TO_STORE: Path within the bucket to store files
TWILIO_ACCOUNT_SID: Twilio account SID for SMS messagingTWILIO_AUTH_TOKEN: Twilio authentication tokenTWILIO_FROM_NUMBER: Twilio phone number to send messages from
- The application uses secure Docker containers to run potentially unsafe commands
- User access is restricted to specified Telegram chat IDs
- API keys are stored securely in environment variables
- Docker socket is mounted to allow container management
- Start a conversation with your bot on Telegram
- Send text messages, voice recordings, images, or documents
- The bot will process your input and respond accordingly
- Use
/settingsto customize the bot's behavior - Use
/remindersto manage your reminders
- π€ Bot not responding: Check your Telegram token and allowed chat IDs
- π€ Voice features not working: Ensure FFmpeg is properly installed
- π³ Container issues: Verify Docker is running and the user has appropriate permissions
- π API errors: Check your API keys and network connection
GenerALL.AI is released under a custom license with the following terms:
- β Personal Use: You may use this software for personal, non-commercial purposes.
- β Modification: You may modify the software and create derivative works.
- β Distribution: You may distribute copies of the original or modified software.
- β Attribution: You must give appropriate credit to the original authors.
- β Commercial Use: Commercial use requires explicit permission from the copyright holders.
See the LICENSE file for complete details.
- Built with python-telegram-bot
- Uses AI models from Anthropic and OpenAI
- Voice synthesis powered by ElevenLabs
Become a Patron: https://www.patreon.com/zorg Donate via Paypal: http://paypal.me/mezorg/15
bc1qyymy3ufvq3c3uq2q4927ll6x4rhvdw8gxlydwc
0x0213A705065B193D14f1A3cd075977e28Da8F9B3
TG2efcamZ1767TkBfeGUn8QWaRGrLpHUxD
B7faayiFUqM64Dgt4iUtpSfWbo4VANBs8bknSXv3e53E
UQDg07heLBcWdYO_sP6_Hc9hCu24E3v05sBJuRqc_DyWKreq
D9RbkgazaGhkT4FHkJtHoh4hDxkzAZwQnK
You can make a donation / subscription, or say thanks in Telegram: https://t.me/ZorgeR
Boosty (Donate / Subscription) : https://boosty.to/zorgg
