Add integration tests for agent with guardrails #378

apetraru-uipath · 2025-12-23T19:04:49Z

Integration Tests for Agent Guardrails

Overview

Adds comprehensive integration tests for agent guardrails, verifying proper invocation and enforcement at Agent, LLM, and Tool scopes.

Guardrails Tested

Built-in Validators

PII Detection (Agent, LLM & Tool scopes) - Detects Email, Address, Person with 0.5 threshold
Prompt Injection (LLM scope) - Detects malicious prompts with 0.5 threshold

Custom Deterministic Guardrails

Filter Action - Removes input_phrase field when containing "donkey"
Block Action - Blocks tool execution when input contains "forbidden"

Actions Tested

Block Action - Stops execution at Agent, LLM, or Tool scope
Filter Action - Removes fields from outputs
Escalate Action (HITL) - Triggers human approval with both approval and rejection flows

valentinabojan · 2025-12-29T12:28:16Z

tests/cli/conftest.py

+def mock_guardrails_service():
+    """Mock the guardrails service to avoid HTTP errors in tests."""
+
+    class MockGuardrailValidationResult(BaseModel):


Why do we need this new class? Cann't we use the GuardrailValidationResult class?

Agree, I will delete this mock class

valentinabojan · 2025-12-29T12:32:19Z

tests/cli/mocks/joke_agent_uipath.json

+        "enabledForEvals": true,
+        "selector": {
+          "scopes": ["Tool"],
+          "matchNames": ["Agent _ Sentence Analyzer"]


Is this the correct tool name? with spaces?

I've downloaded the uis archive and in agent.json the tool names are always with spaces:

We can double check, I know that I had to change the input files agent.json to have tools original names.

valentinabojan · 2025-12-29T12:34:39Z

tests/cli/mocks/joke_agent_with_guardrails.py

+)
+
+# Block Guardrail for Tool - blocks if input matches forbidden pattern
+# Note: Using CONTAINS operator as MATCHES_REGEX has issues with the current implementation


What is this note about? What exactly is not working with CONTAINS?

It was an issue with Regex matching, probably it was under development at the time I added the tests.
I will add a test specifically for regex, to check if it works as expected

valentinabojan · 2025-12-29T12:36:20Z

tests/cli/mocks/joke_agent_with_guardrails.py

+guardrails = build_guardrails_with_actions(
+    [
+        custom_filter_guardrail,
+        pii_detection_guardrail,


Can we give them more explicit names, where to include the scope and action? Like block_agent_pii_detection_guardrail and so on...

I did this. Moreover, I changed how agent is constructed for each test case so that we have only one guardrail configured for the agent per test

valentinabojan · 2025-12-29T12:37:15Z

tests/cli/mocks/joke_agent_with_guardrails.py

+    ]
+
+
+# Define guardrails programmatically (matching the uipath.json configuration)


Do we need to pass guardrails in the joke_agent_uipath.json file? I see that they are not in sync anyway so it looks like it doesn't matter to have them there as well?

I've changed the approach to have in joke_agent_uipath.json a template and to select the guardrails at each test.
I prefer to have that agent template configuration in a file instead of inline-ing it into every test case

valentinabojan · 2025-12-29T12:38:54Z

tests/cli/test_agent_with_guardrails.py

+                # Setup files
+                with open("joke_agent_with_guardrails.py", "w", encoding="utf-8") as f:
+                    f.write(joke_agent_script)
+                with open("uipath.json", "w", encoding="utf-8") as f:


What file do we use exacly here? "uipath.json"?

Good catch, we only use joke_agent_with_guardrails.py and langgraph.json. I will update the setup.

valentinabojan · 2025-12-29T12:43:27Z

tests/cli/test_agent_with_guardrails.py

+
+                    if not has_tool_message:
+                        # First call: return tool call WITH "donkey" in the sentence
+                        return AIMessage(


I don't really understand the real value of these tests, since we mock the model responses. How will these tests protect us in case the model response change?

The scope of those tests (at first version) is to protect us against internal changes (someone updates something in guardrails code). My first intention was to mock any external interactions.

I can check (in another PR) if I can use a real LLM.... but those I see them as tests that are checking if external tools are working as required by us.

Add comprehensive integration tests for guardrails at different scopes: - Agent-level guardrails (PII detection) - LLM-level guardrails (Prompt injection) - Tool-level guardrails (Filter, Block, and PII detection) Tests verify that guardrails are properly invoked and block/filter as expected.

apetraru-uipath force-pushed the chore/tests_for_guardrails branch 6 times, most recently from 3876794 to f260a25 Compare December 29, 2025 08:26

valentinabojan reviewed Dec 29, 2025

View reviewed changes

apetraru-uipath force-pushed the chore/tests_for_guardrails branch from f260a25 to 3ceef2c Compare December 29, 2025 14:39

apetraru-uipath force-pushed the chore/tests_for_guardrails branch from 3ceef2c to 052c5cf Compare December 29, 2025 14:47

		]


		# Define guardrails programmatically (matching the uipath.json configuration)

Add integration tests for agent with guardrails #378

Are you sure you want to change the base?

Add integration tests for agent with guardrails #378

Uh oh!

Conversation

apetraru-uipath commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Integration Tests for Agent Guardrails

Overview

Guardrails Tested

Built-in Validators

Custom Deterministic Guardrails

Actions Tested

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

apetraru-uipath commented Dec 23, 2025 •

edited

Loading