NitroGen Server

NitroGen Server is a specialized inference server for the NitroGen foundation model (originally by MineDojo). It provides a high-performance backend for generalist gaming agents, allowing them to play games by processing visual input and generating controller commands.

This project dockerizes the original NitroGen implementation and extends it with a dual-protocol architecture, enabling connections from both Python-based clients and external tools like BizHawk (Lua).

Important

License & Usage Restrictions: This project is based on NVIDIA's Work and is licensed under the NVIDIA License. It is strictly for non-commercial research purposes only. Use for military, surveillance, nuclear technology, or biometric processing is expressly prohibited.

✨ Features

Foundation Model: powered by nvidia/NitroGen, a large multimodal model for game control.
Dual Protocol Support:
- ZeroMQ + Pickle: Fast, native communication for Python clients.
- TCP + JSON: Universal standard for connecting from Lua, C#, or other languages (perfect for Emulator integration).
Dockerized: Zero-dependency deployment on the host. Handles CUDA drivers and environment setup automatically.
Auto-Healing: Automatically downloads the model weights (ng.pt) on the first run if they are missing.
Persistent Caching: Uses a local volume for models to avoid re-downloading.
LoRA Support: Use custom fine-tuned LoRA adapters with auto-merging capabilities.

🚀 Quick Start (Docker)

This is the recommended way to run the server.

Prerequisites

Host with an NVIDIA GPU (Linux or Windows)
Docker & Docker Compose
NVIDIA Container Toolkit (required for GPU access)

1. Clone Repository

git clone https://github.com/artryazanov/nitrogen-server.git
cd nitrogen-server

2. Start the Server

docker-compose up --build

On the first run, the server will automatically download the ~2GB model checkpoint. This may take a few minutes. Once ready, you will see:

ZMQ Server running on port 5555
Simple TCP Server (JSON+Bytes) running on port 5556

💻 Usage

Connecting Clients

The server exposes two ports by default:

Protocol	Port	Description	Target Use Case
ZeroMQ	`5555`	Serialized Python objects (Pickle)	Python Clients (e.g., `scripts/play.py`)
TCP/JSON	`5556`	JSON Header + Image (BMP/PNG) or Raw Bytes	BizHawk / Emulators / Non-Python

Python Client Example

We provide a play.py script to connect a game running on a client (e.g. Windows) to the NitroGen server.

# On your Windows Gaming Machine
python scripts/play.py --process "celeste.exe" --ip <SERVER_IP> --port 5555

BizHawk (Lua) Integration

For emulators like BizHawk, use the TCP protocol on port 5556.

We provide a ready-to-use client script in a separate repository: NitroGen BizHawk AI Agent.

Please refer to the repository documentation for setup and usage instructions.

Protocol Details

The server supports Automatic Format Detection on port 5556. It handles Any Image Format (PNG, BMP, JPG) or Raw Pixels.

Steps:

Open Socket: Connect to <SERVER_IP>:5556.
Send Request: Send a JSON header terminated by \n. It is highly recommended to include the len field (file size in bytes) to ensure perfect synchronization.
```
{
    "type": "predict",
    "len": 12345
{
    "type": "predict",
    "len": 12345,
    "resize_mode": "pad"
}
```
Resize Modes (resize_mode):
- pad (Default): Pads the image with black borders to preserve aspect ratio (adds bars), then resizes to 256x256.
- crop: Center-crops a square from the image, then resizes to 256x256.
- stretch: Stretches the image to fit 256x256 (may distort aspect ratio).
Send Image:
- Option A (Recommended): Send a standard image file (PNG, BMP, JPG). The server uses cv2.imdecode to parse it automatically.
- Option B (Fallback): Send 196,608 bytes of raw RGB pixel data (256x256). If len matches exactly, it is treated as raw buffer.
Receive Response: Read the JSON response terminated by \n.

🐞 Debugging Mode

You can enable debug mode to save detailed artifacts for every request (received image, JSON parameters, processed image, model response).

Enable via CLI (Manual):

python scripts/serve.py models/nvidia/NitroGen/ng.pt --debug --debug-dir debug_output

Enable via Docker: To run with debug mode in Docker, use docker-compose run to pass the flag and map the ports:

docker-compose run --service-ports nitrogen-server --debug

Note: This will output artifacts to the debug/ folder on your host machine (mapped in docker-compose.yml).

Artifacts generated:

*_1_received.png: The original image received from the client.
*_2_params.json: The JSON parameters of the request.
*_3_processed.png: The preprocessed image (resized/padded) sent to the model.
*_4_response.json: The model's prediction response.

Run with LoRA Adapter: To use a different model (like a LoRA adapter) with Docker, use docker-compose run to override the start command arguments:

# Ensure your checkpoint is in the `models/` directory (e.g. models/checkpoints/final_model)
docker-compose run --service-ports nitrogen-server models/checkpoints/final_model --base-model models/nvidia/NitroGen/ng.pt

🛠 Manual Installation (Development)

If you prefer to run the server without Docker (e.g., for development):

# 1. Install dependencies
pip install -e .[serve] peft
pip install "huggingface_hub[cli]"

# 2. Download Model
# The server requires the model weights (~2GB) to be downloaded locally.
huggingface-cli download nvidia/NitroGen ng.pt --local-dir models/nvidia/NitroGen

# 3. Run Server
python scripts/serve.py models/nvidia/NitroGen/ng.pt [--debug]

# 4. Run with LoRA Adapter
# Point to the LoRA directory. Ensure the base model is also available.
python scripts/serve.py models/checkpoints/final_model --base-model models/nvidia/NitroGen/ng.pt

📂 Project Structure

nitrogen/: Core library code (model definition, inference logic).
scripts/: Executable scripts.
- serve.py: The main server entry point.
- play.py: Python client script for running agents.
- start.sh: Entrypoint script for Docker.
models/: Directory for storing downloaded model weights (gitignored).
tests/: Unit and integration tests.
Dockerfile: Definition for the server container.

🔗 Credits

This project is a fork and extension of the original work by MineDojo.

Original Repository: MineDojo/NitroGen
Hugging Face Model: nvidia/NitroGen

Check README_ORIGINAL.md for the original documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github/workflows		.github/workflows
assets		assets
nitrogen		nitrogen
scripts		scripts
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_ORIGINAL.md		README_ORIGINAL.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NitroGen Server

✨ Features

🚀 Quick Start (Docker)

Prerequisites

1. Clone Repository

2. Start the Server

💻 Usage

Connecting Clients

Python Client Example

BizHawk (Lua) Integration

Protocol Details

🐞 Debugging Mode

🛠 Manual Installation (Development)

📂 Project Structure

🔗 Credits

About

Uh oh!

Releases

Packages

Languages

License

artryazanov/nitrogen-server

Folders and files

Latest commit

History

Repository files navigation

NitroGen Server

✨ Features

🚀 Quick Start (Docker)

Prerequisites

1. Clone Repository

2. Start the Server

💻 Usage

Connecting Clients

Python Client Example

BizHawk (Lua) Integration

Protocol Details

🐞 Debugging Mode

🛠 Manual Installation (Development)

📂 Project Structure

🔗 Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages