Sigma (Zero) Rule Evaluator

Sigma Zero

Sigma (Zero) Rule Evaluator

A high-performance Rust application for evaluating Sigma detection rules against large volumes of security logs with parallel processing capabilities.

Features

⚡ Parallel Processing: Leverages all CPU cores using Rayon for maximum throughput
📊 Scalable: Efficiently handles huge log files with streaming and batch processing
🎯 Flexible Rule Support: Supports standard Sigma rule YAML format
🔍 Pattern Matching: Includes wildcard matching, regex support, and IP/domain detection
🚀 Fast: Optimized for speed with zero-copy parsing where possible
📝 JSON Output: Results in structured JSON format for easy integration

Installation

Prerequisites

Rust 1.70+ (install from https://rustup.rs)

Build from Source

# Clone or download the project
cd sigmazero

# Build in release mode for maximum performance
cargo build --release

# The binary will be at target/release/sigma-zero

Usage

Basic Usage

sigma-zero --rules-dir ./examples/rules --logs ./examples/logs

Command Line Options

Options:
  -r, --rules-dir <RULES_DIR>     Path to directory containing Sigma rules (YAML files)
  -l, --logs <LOGS>               Path to log file or directory containing log files (JSON format)
  -c, --correlation-rules <DIR>   Path to directory containing correlation rules (optional)
  -w, --workers <WORKERS>         Number of parallel workers (defaults to number of CPU cores)
  -o, --output <OUTPUT>           Output file for matches (defaults to stdout)
  -f, --format <FORMAT>           Output format: json, jsonl, or text [default: text]
      --validate                  Validate rules only (parse and exit; no log evaluation)
      --filter-tag <TAG>          Filter rules by tag (can be repeated)
      --filter-level <LEVEL>      Filter rules by level (can be repeated)
      --filter-id <ID>            Filter rules by id (can be repeated)
      --field-map <MAP>           Field mapping rule_field:log_field (e.g. CommandLine:command_line). Comma-separated or repeated
  -v, --verbose                   Enable verbose logging
  -h, --help                      Print help
  -V, --version                   Print version

Examples

Process a single log file:

sigma-zero -r ./rules -l ./logs/security.json

Process a directory of logs with 8 parallel workers:

sigma-zero -r ./rules -l ./logs -w 8

Output as JSON or JSONL:

sigma-zero -r ./rules -l ./logs -f json -o matches.json
sigma-zero -r ./rules -l ./logs -f jsonl

Validate rules only (no log evaluation):

sigma-zero -r ./rules --validate

Filter rules by tag or level:

sigma-zero -r ./rules -l ./logs --filter-tag attack.execution --filter-level high

Map rule field names to log field names (e.g. Windows rule fields to your log schema):

sigma-zero -r ./rules -l ./logs --field-map CommandLine:command_line,ProcessName:process_name

Save results to a file:

sigma-zero -r ./rules -l ./logs -o matches.json

Enable verbose logging for debugging:

sigma-zero -r ./rules -l ./logs -v

Streaming mode

For real-time or pipe-based evaluation, use sigma-zero-streaming. It reads JSON logs from stdin and evaluates them as they arrive:

tail -f /var/log/app.json | sigma-zero-streaming -r ./rules
journalctl -f -o json | sigma-zero-streaming -r ./rules

Streaming options:

-r, --rules-dir – Path to Sigma rules
-c, --correlation-rules – Optional correlation rules directory
-b, --batch-size <N> – Process logs in batches of N (default: 1 for real-time)
-f, --output-format <json|text|silent> – Output format (default: text)
-m, --min-level <LEVEL> – Only output matches at or above this level (low, medium, high, critical)

Throughput: Use a larger batch size (e.g. -b 100) to trade latency for higher throughput when reading from a pipe or file.

Log Format

Logs must be in JSON format with one log entry per line (JSONL). Each log entry should be a JSON object with arbitrary fields:

{
  "timestamp": "2025-11-06T10:15:30Z",
  "event_type": "process_creation",
  "process_name": "powershell.exe",
  "command_line": "powershell.exe -enc ZQBjAGgAbwAgACIASABlAGwAbABvACIACgA=",
  "user": "john.doe",
  "source_ip": "192.168.1.50"
}

Sigma Rule Format

Rules follow the standard Sigma format. Here's an example:

title: Suspicious Process Execution
id: 12345678-1234-1234-1234-123456789abc
description: Detects execution of suspicious processes
status: experimental
level: high
detection:
  selection:
    process_name:
      - '*powershell.exe'
      - '*cmd.exe'
      - '*mimikatz*'
    command_line:
      - '*-enc*'
      - '*bypass*'
  condition: selection
tags:
  - attack.execution
  - attack.t1059

Supported Features

Field matching: Exact match, substring match, wildcard (*) support
Field modifiers:
- startswith - Match values that start with pattern
- endswith - Match values that end with pattern
- contains - Match values containing pattern (default)
- all - Require all values to match (instead of any)
- re - Regular expression matching
- base64 - Match base64-decoded content
- lt/lte/gt/gte - Numeric comparisons
Advanced Conditions:
- AND - All conditions must match
- OR - At least one condition must match
- NOT - Negate/exclude conditions
- Parentheses () for grouping
- 1 of them, all of them - Pattern-based selection
- 1 of selection_* - Wildcard selection matching
- Threshold/count conditions: selection_name | count > 5 or | count >= N – rule fires when the number of logs matching the selection (in the current batch) satisfies the threshold. Evaluated only in batch mode (file or evaluate_log_batch).

📖 See CONDITION_OPERATORS.md for complete documentation on all operators and modifiers.

Multiple values: Arrays of values for OR logic
Conditions:
- Single selection
- AND conditions (all selections must match)
- OR conditions (at least one selection must match)
Wildcards: Use * for wildcard matching (e.g., *powershell*)

See FIELD_MODIFIERS.md for complete field modifier documentation.

Example Rules Included

suspicious_process.yml: Detects suspicious process executions like PowerShell with encoded commands
suspicious_network.yml: Detects connections to known malicious domains or suspicious IPs
privilege_escalation.yml: Detects privilege escalation attempts
modifiers_startswith.yml: Demonstrates startswith modifier usage
modifiers_endswith.yml: Demonstrates endswith modifier for file extensions
modifiers_regex.yml: Demonstrates regex pattern matching
modifiers_all.yml: Demonstrates all modifier for multi-condition matching
modifiers_base64.yml: Demonstrates base64 content detection
modifiers_comparison.yml: Demonstrates numeric comparison operators

Example Log Files Included

The project includes 4 realistic security log files (170 total events):

security_events.json (15 events) - Basic security events with mixed legitimate and suspicious activity
critical_security_events.json (50 events) - Comprehensive attack lifecycle from initial compromise to ransomware
apt_attack_chain.json (50 events) - Advanced Persistent Threat multi-stage attack campaign
mixed_traffic.json (55 events) - Realistic mix of legitimate (70%) and malicious (30%) traffic for false positive testing

Attack Coverage: All 12 MITRE ATT&CK tactics represented
Use Cases: Development, testing, training, incident response simulation

Performance Considerations

Parallel Processing

The engine automatically uses all available CPU cores. You can control this with the -w flag:

# Use 16 workers for maximum throughput on a 16+ core system
sigma-zero -r ./rules -l ./huge-logs -w 16

Memory Efficiency

Logs are streamed line-by-line to minimize memory usage
Parsed logs are processed in batches
Results are collected incrementally

Optimization Tips

Compile in release mode: Always use cargo build --release
Adjust worker count: Match to your CPU core count for best results
Use SSD storage: Faster disk I/O significantly improves performance
Rule optimization: More specific rules (fewer wildcards) evaluate faster

Benchmarking

To benchmark performance on your system:

# Create a large test log file
seq 1 1000000 | while read i; do 
  echo "{\"id\": $i, \"process_name\": \"test.exe\", \"command_line\": \"test command $i\"}"
done > large_test.json

# Time the evaluation
time sigma-zero -r ./examples/rules -l large_test.json -w $(nproc)

Output Format

Matches are output in JSON format:

{
  "rule_id": "12345678-1234-1234-1234-123456789abc",
  "rule_title": "Suspicious Process Execution",
  "level": "high",
  "matched_log": {
    "timestamp": "2025-11-06T10:15:30Z",
    "process_name": "powershell.exe",
    "command_line": "powershell.exe -enc ...",
    "user": "john.doe"
  },
  "timestamp": "2025-11-06T12:30:45.123Z"
}

Limitations

Condition complexity: Complex condition expressions with nested parentheses and NOT operators are simplified
Aggregation: Time-based aggregations and correlations not yet supported
Field modifiers: Most common modifiers implemented (startswith, endswith, contains, all, re, base64, comparisons). Advanced modifiers like utf16le/utf16be are planned for future releases

Resources

## About

This project has been mainly generated with the help of Claude.ai during an Amsterdam trip for the startup, but it seems to work correctly and can handle many cases without the need of a full SIEM ! You can see it like a micro SIEM for local evaluation of your logs or if you would like to evaluate specific logs for edge cases !

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github		.github
bin		bin
docs		docs
examples		examples
scripts		scripts
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sigma Zero

Sigma (Zero) Rule Evaluator

Features

Installation

Prerequisites

Build from Source

Usage

Basic Usage

Command Line Options

Examples

Streaming mode

Log Format

Sigma Rule Format

Supported Features

Example Rules Included

Example Log Files Included

Performance Considerations

Parallel Processing

Memory Efficiency

Optimization Tips

Benchmarking

Output Format

Limitations

Resources

About

Uh oh!

Releases

Packages

Languages

License

ping2A/sigmazero

Folders and files

Latest commit

History

Repository files navigation

Sigma Zero

Sigma (Zero) Rule Evaluator

Features

Installation

Prerequisites

Build from Source

Usage

Basic Usage

Command Line Options

Examples

Streaming mode

Log Format

Sigma Rule Format

Supported Features

Example Rules Included

Example Log Files Included

Performance Considerations

Parallel Processing

Memory Efficiency

Optimization Tips

Benchmarking

Output Format

Limitations

Resources

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages