policy-go

A high-performance policy evaluation library for OpenTelemetry telemetry data in Go. Built with Hyperscan for fast regex matching and designed for hot-reload support.

Features

High-Performance Matching: Uses Intel Hyperscan for vectorized regex evaluation
Multi-Telemetry Support: Evaluate logs, metrics, and traces with type-safe APIs
Hot Reload: File-based policy providers support automatic reloading on change
Thread-Safe: Immutable snapshots for concurrent evaluation
Extensible: Provider interface for custom policy sources (file, HTTP, gRPC)
Statistics: Per-policy hit/drop/sample counters with atomic operations
Rate Limiting: Lock-free per-policy rate limiting with configurable windows
AND Semantics: Multiple matchers in a policy are AND'd together

Installation

go get github.com/usetero/policy-go

Requirements

Go 1.21+
Hyperscan library (via gohs)

On macOS with Homebrew:

brew install hyperscan

On Ubuntu/Debian:

apt-get install libhyperscan-dev

Quick Start

package main

import (
    "fmt"
    "log"
    "time"

    "github.com/usetero/policy-go"
)

// Define your log record type
type LogRecord struct {
    Body               []byte
    SeverityText       []byte
    TraceID            []byte
    LogAttributes      map[string]any
    ResourceAttributes map[string]any
}

// Implement a match function to extract field values
func matchLog(r *LogRecord, ref policy.LogFieldRef) []byte {
    // Handle field lookups
    if ref.IsField() {
        switch ref.Field {
        case policy.LogFieldBody:
            return r.Body
        case policy.LogFieldSeverityText:
            return r.SeverityText
        case policy.LogFieldTraceID:
            return r.TraceID
        default:
            return nil
        }
    }

    // Handle attribute lookups
    var attrs map[string]any
    switch {
    case ref.IsResourceAttr():
        attrs = r.ResourceAttributes
    case ref.IsRecordAttr():
        attrs = r.LogAttributes
    default:
        return nil
    }
    return traversePath(attrs, ref.AttrPath)
}

func traversePath(m map[string]any, path []string) []byte {
    if len(path) == 0 || m == nil {
        return nil
    }
    val, ok := m[path[0]]
    if !ok {
        return nil
    }
    if len(path) == 1 {
        switch v := val.(type) {
        case string:
            return []byte(v)
        case []byte:
            return v
        }
        return nil
    }
    if nested, ok := val.(map[string]any); ok {
        return traversePath(nested, path[1:])
    }
    return nil
}

func main() {
    // Create a registry
    registry := policy.NewPolicyRegistry()

    // Create a file provider with hot reload
    provider := policy.NewFileProvider("policies.json",
        policy.WithPollInterval(30*time.Second),
        policy.WithOnError(func(err error) {
            log.Printf("Policy error: %v", err)
        }),
    )
    defer provider.Stop()

    // Register the provider
    handle, err := registry.Register(provider)
    if err != nil {
        log.Fatal(err)
    }
    defer handle.Unregister()

    // Create an engine
    engine := policy.NewPolicyEngine(registry)

    // Evaluate a log record
    record := &LogRecord{
        Body:         []byte("debug trace message"),
        SeverityText: []byte("INFO"),
        LogAttributes: map[string]any{
            "http": map[string]any{
                "method": "GET",
            },
        },
    }

    result := policy.EvaluateLog(engine, record, matchLog)
    fmt.Printf("Result: %s\n", result) // "drop" if matched by policy
}

Core Concepts

PolicyRegistry

The registry manages policies from multiple providers. When policies change, it automatically recompiles the Hyperscan database and produces a new immutable snapshot.

registry := policy.NewPolicyRegistry()

// Register providers
handle, _ := registry.Register(fileProvider)
handle, _ := registry.Register(httpProvider)

// Collect stats
stats := registry.CollectStats()

PolicyEngine

The engine evaluates telemetry against compiled policies. It holds a reference to the registry and automatically uses the latest snapshot for each evaluation.

engine := policy.NewPolicyEngine(registry)

// Evaluate logs
result := policy.EvaluateLog(engine, logRecord, matchLogFunc)

// Evaluate metrics
result := policy.EvaluateMetric(engine, metricRecord, matchMetricFunc)

// Evaluate traces/spans
result := policy.EvaluateTrace(engine, spanRecord, matchTraceFunc)

switch result {
case policy.ResultNoMatch:
    // No policy matched - pass through
case policy.ResultKeep:
    // Matched a keep policy (or under rate limit)
case policy.ResultDrop:
    // Matched a drop policy (or over rate limit)
case policy.ResultSample:
    // Sampled (for metrics without sample key)
}

Match Functions

Instead of implementing an interface, you provide a match function that extracts field values from your telemetry types. This allows maximum flexibility in how you represent your data.

// LogMatchFunc extracts values from log records
type LogMatchFunc[T any] func(record T, ref LogFieldRef) []byte

// MetricMatchFunc extracts values from metrics
type MetricMatchFunc[T any] func(record T, ref MetricFieldRef) []byte

// TraceMatchFunc extracts values from spans
type TraceMatchFunc[T any] func(record T, ref TraceFieldRef) []byte

Example match function for logs:

func matchLog(r *MyLogRecord, ref policy.LogFieldRef) []byte {
    // Handle field lookups
    if ref.IsField() {
        switch ref.Field {
        case policy.LogFieldBody:
            return r.Body
        case policy.LogFieldSeverityText:
            return r.SeverityText
        default:
            return nil
        }
    }

    // Handle attribute lookups
    var attrs map[string]any
    switch {
    case ref.IsResourceAttr():
        attrs = r.ResourceAttributes
    case ref.IsRecordAttr():
        attrs = r.LogAttributes
    case ref.IsScopeAttr():
        attrs = r.ScopeAttributes
    default:
        return nil
    }
    return traversePath(attrs, ref.AttrPath)
}

Field References

Field references (LogFieldRef, MetricFieldRef, TraceFieldRef) describe what value to extract. Use helper methods to determine the reference type:

ref.IsField()        // Is this a direct field (body, name, etc.)?
ref.IsResourceAttr() // Is this a resource attribute?
ref.IsRecordAttr()   // Is this a record/span/datapoint attribute?
ref.IsScopeAttr()    // Is this a scope attribute?
ref.IsEventAttr()    // Is this an event attribute? (traces only)
ref.IsLinkAttr()     // Is this a link attribute? (traces only)

ref.Field            // The field enum value
ref.AttrPath         // The attribute path (e.g., ["http", "method"])

Attribute Scopes

AttrScopeResource: Resource-level attributes (service.name, etc.)
AttrScopeScope: Instrumentation scope attributes
AttrScopeRecord: Record-level attributes (log attributes, span attributes, datapoint attributes)
AttrScopeEvent: Span event attributes (traces only)
AttrScopeLink: Span link attributes (traces only)

Configuration

Config File

Use a JSON configuration file to define providers:

{
  "policy_providers": [
    {
      "type": "file",
      "id": "local-policies",
      "path": "/etc/tero/policies.json",
      "poll_interval_secs": 30
    }
  ]
}

Loading Config

config, err := policy.LoadConfig("config.json")
if err != nil {
    log.Fatal(err)
}

loader := policy.NewConfigLoader(registry).
    WithOnError(func(err error) {
        log.Printf("Provider error: %v", err)
    })

providers, err := loader.Load(config)
if err != nil {
    log.Fatal(err)
}
defer policy.StopAll(providers)
defer policy.UnregisterAll(providers)

Policy Format

Policies are defined in JSON format following the Tero Policy Specification:

Log Policies

{
  "policies": [
    {
      "id": "drop-debug-logs",
      "name": "Drop debug logs containing trace",
      "log": {
        "match": [
          { "log_field": "body", "regex": "debug" },
          { "log_field": "body", "regex": "trace" }
        ],
        "keep": "none"
      }
    },
    {
      "id": "drop-nginx-logs",
      "name": "Drop nginx access logs",
      "log": {
        "match": [{ "log_attribute": "ddsource", "exact": "nginx" }],
        "keep": "none"
      }
    }
  ]
}

Metric Policies

{
  "policies": [
    {
      "id": "drop-internal-metrics",
      "name": "Drop internal metrics",
      "metric": {
        "match": [{ "metric_field": "name", "starts_with": "internal." }],
        "keep": false
      }
    },
    {
      "id": "drop-histogram-metrics",
      "name": "Drop histogram type metrics",
      "metric": {
        "match": [{ "metric_field": "type", "exact": "histogram" }],
        "keep": false
      }
    }
  ]
}

Trace Policies

{
  "policies": [
    {
      "id": "sample-traces",
      "name": "Sample 10% of traces",
      "trace": {
        "match": [{ "span_field": "kind", "exact": "server" }],
        "keep": { "percentage": 10 }
      }
    },
    {
      "id": "drop-health-checks",
      "name": "Drop health check spans",
      "trace": {
        "match": [{ "span_field": "name", "exact": "/health" }],
        "keep": { "percentage": 0 }
      }
    }
  ]
}

Trace sampling uses OTel-compliant consistent probability sampling. The same trace ID always produces the same sampling decision, ensuring all spans in a trace are kept or dropped together.

Matcher Types

Log Matchers

Field	Description
`log_field`	Match on log fields: `body`, `severity_text`, `trace_id`, etc.
`log_attribute`	Match on log record attributes
`resource_attribute`	Match on resource attributes
`scope_attribute`	Match on scope attributes

Metric Matchers

Field	Description
`metric_field`	Match on metric fields: `name`, `type`, `unit`, etc.
`datapoint_attribute`	Match on datapoint attributes
`resource_attribute`	Match on resource attributes
`scope_attribute`	Match on scope attributes

Trace Matchers

Field	Description
`span_field`	Match on span fields: `name`, `kind`, `status`, etc.
`span_attribute`	Match on span attributes
`resource_attribute`	Match on resource attributes
`scope_attribute`	Match on scope attributes
`event_name`	Match on span event names
`event_attribute`	Match on span event attributes
`link_trace_id`	Match on span link trace IDs
`link_attribute`	Match on span link attributes

Nested Attribute Access

Attributes can be accessed using nested paths for structured data:

{
  "log_attribute": { "path": ["http", "request", "method"] },
  "exact": "POST"
}

Shorthand forms are also supported:

Array: "log_attribute": ["http", "request", "method"]
String (single key): "log_attribute": "user_id"

Match Conditions

Condition	Description
`regex`	Match if field matches the regex pattern
`exact`	Match if field equals the exact value
`starts_with`	Match if field starts with the literal prefix
`ends_with`	Match if field ends with the literal suffix
`contains`	Match if field contains the literal substring
`exists`	Match if field exists (true) or doesn't exist (false)
`negated`	Invert the match condition
`case_insensitive`	Make the match case-insensitive (works with all matchers)

The literal matchers (starts_with, ends_with, contains, exact) are optimized using Hyperscan and are more efficient than equivalent regex patterns.

Keep Actions

Action	Description
`"all"`	Keep all matching records
`"none"`	Drop all matching records
`"N%"`	Sample at N% (probabilistic)
`"N/s"`	Rate limit to N records per second
`"N/m"`	Rate limit to N records per minute

Sampling with Sample Key

For consistent sampling (same key always produces same decision), use sample_key:

{
  "id": "sample-by-trace",
  "name": "Sample 10% of logs by trace ID",
  "log": {
    "match": [{ "log_field": "body", "contains": "request" }],
    "keep": "10%",
    "sample_key": {
      "log_field": "trace_id"
    }
  }
}

The sample key can reference any field or attribute:

log_field: Use a log field (body, trace_id, span_id, etc.)
log_attribute: Use a log record attribute
resource_attribute: Use a resource attribute
scope_attribute: Use a scope attribute

When a sample key is configured:

Records with the same key value always get the same keep/drop decision
This ensures consistent sampling across distributed systems
If the sample key field is empty/missing, the record is kept by default

Rate Limiting

Rate limiting allows you to cap the number of records kept per time window:

{
  "id": "rate-limit-noisy-service",
  "name": "Rate limit logs from noisy service to 100/s",
  "log": {
    "match": [
      { "resource_attribute": "service.name", "exact": "noisy-service" }
    ],
    "keep": "100/s"
  }
}

Rate limiting features:

Lock-free implementation: Uses atomic operations for thread-safe access without mutexes
Per-policy rate limiters: Each policy with rate limiting gets its own limiter
Automatic window reset: Windows reset inline on first request after expiry
Two time windows: Use /s for per-second or /m for per-minute limits

When the rate limit is exceeded, records are dropped (ResultDrop). When under the limit, records are kept (ResultKeep).

AND Semantics

All matchers within a single policy are AND'd together. A policy only matches when ALL of its matchers match:

{
  "match": [
    { "log_field": "body", "regex": "debug" },
    { "log_field": "body", "regex": "trace" }
  ]
}

This policy matches logs where the body contains BOTH "debug" AND "trace".

File Provider

The file provider loads policies from a JSON file and supports hot reload:

provider := policy.NewFileProvider("policies.json",
    policy.WithPollInterval(30*time.Second),  // Check every 30 seconds
    policy.WithOnReload(func() {
        log.Println("Policies reloaded")
    }),
    policy.WithOnError(func(err error) {
        log.Printf("Error: %v", err)
    }),
)
defer provider.Stop()

Statistics

The registry maintains per-policy statistics with atomic counters:

stats := registry.CollectStats()
for _, s := range stats {
    fmt.Printf("Policy %s: hits=%d drops=%d samples=%d\n",
        s.PolicyID, s.Hits, s.Drops, s.Samples)
}

TODO

Zero-Allocation Optimizations

The current implementation allocates ~16-25 objects per evaluation. To achieve zero-allocation evaluation:

Pool matchCounts slice in PolicyEngine.Evaluate()
Pool disqualified slice in PolicyEngine.Evaluate()
Pool Hyperscan scratch space per-database
Pool match result slices from db.Scan()
Use dense index arrays instead of maps for policy match tracking
Pre-allocate result slices in hot paths

Telemetry Type Support

Log policies (log field)
Metric policies (metric field)
Trace policies (trace field) with OTel-compliant consistent sampling

Provider Support

File provider with hot reload
HTTP provider (poll-based)
gRPC provider (streaming)

Additional Features

Nested attribute path access (e.g., http.request.method)
Optimized literal matchers (starts_with, ends_with, contains)
Case-insensitive matching via Hyperscan flags
Sampling with hash-based determinism via sample_key
Rate limiting support (N/s, N/m) with lock-free implementation
Transform actions (keep with modifications)
Policy validation CLI tool
Prometheus metrics exporter for stats

License

Apache 2.0 - See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github		.github
bench		bench
bin		bin
examples		examples
internal		internal
proto		proto
testdata		testdata
.gitignore		.gitignore
.yamlfmt.yaml		.yamlfmt.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
buf.gen.yaml		buf.gen.yaml
config.go		config.go
config_test.go		config_test.go
engine.go		engine.go
engine_test.go		engine_test.go
error.go		error.go
error_test.go		error_test.go
field.go		field.go
file_provider.go		file_provider.go
go.mod		go.mod
go.sum		go.sum
grpc_provider.go		grpc_provider.go
grpc_provider_test.go		grpc_provider_test.go
http_provider.go		http_provider.go
http_provider_test.go		http_provider_test.go
matchable.go		matchable.go
matchable_test.go		matchable_test.go
policy.go		policy.go
policy_test.go		policy_test.go
provider.go		provider.go
registry.go		registry.go
service_metadata.go		service_metadata.go
snapshot.go		snapshot.go
snapshot_test.go		snapshot_test.go
stats.go		stats.go

License

usetero/policy-go

Folders and files

Latest commit

History

Repository files navigation

policy-go

Features

Installation

Requirements

Quick Start

Core Concepts

PolicyRegistry

PolicyEngine

Match Functions

Field References

Attribute Scopes

Configuration

Config File

Loading Config

Policy Format

Log Policies

Metric Policies

Trace Policies

Matcher Types

Log Matchers

Metric Matchers

Trace Matchers

Nested Attribute Access

Match Conditions

Keep Actions

Sampling with Sample Key

Rate Limiting

AND Semantics

File Provider

Statistics

TODO

Zero-Allocation Optimizations

Telemetry Type Support

Provider Support

Additional Features

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 2

Uh oh!

Languages

Packages