test(suite): add smoke test for docs/templates/suite_template.py #40

Vansh-Lohia · 2025-12-15T17:19:54Z

Mock a simple agent and verify the FAQAdapter template pattern records a response and passes a metric

Description

Adds a minimal smoke test for the sample FAQAdapter pattern in suite_template.py. The test mocks an agent with .connect() and .answer(), inlines a small dataset via DatasetSource.from_list, implements a MockAdapter (test-only) that conforms to BaseAdapter and returns an AdapterOutcome, and asserts that the scenario run is recorded and the answer_correctness metric is 1.0 for an exact match.

Fixes #7

Type of Change

Please delete options that are not relevant:

Test coverage improvement

Changes Made

Added test_suite_template_smoke.py with a smoke test exercising the template pattern.
Implemented MockAdapter (test-only) that uses AdapterOutcome and records trace events.
Updated test to use built-in AnswerCorrectnessMetric and assert answer_correctness == 1.0.

Testing

Please describe the tests you ran to verify your changes:

Added new tests for new functionality.
Manual testing performed.

Test Configuration

Python version: 3.12
Operating System: macOS
Relevant adapters tested: none(test uses a mocked adapter)

Test Results

poetry run pytest tests/test_suite_template_smoke.py -q
.                                                                                                       [100%]
1 passed, 2 warnings in 0.01s
# Paste relevant test output or results

Code Quality

My code follows the project's style guidelines (Ruff, Black)
I have run pre-commit run --all-files and addressed any issues
I have performed a self-review of my own code

Documentation

I have updated relevant documentation (README, docs/, inline comments)

Breaking Changes

If this PR introduces breaking changes, please describe:

What breaks:
Migration path for users:
Deprecation warnings added (if applicable):

Dependencies

No new dependencies added

New dependencies:

package-name: Reason for adding

Performance Impact

Describe any performance implications:

No performance impact

Additional Context

Add any other context about the PR here:

Keeps test dependencies limited to the existing test stack.
Recommend running full CI (or poetry install --with dev then poetry run pytest) to verify broader test suite.

Checklist

I have read the CONTRIBUTING.md guide
My branch name follows the convention (feature/, fix/, docs/, etc.)
My commit messages follow the conventional commit format
I have tested my changes locally
I have updated the documentation
I have added tests that prove my fix is effective or that my feature works
All new and existing tests pass
I have checked for security vulnerabilities in any new dependencies

Reviewer Notes

Please pay special attention to:

Whether the smoke test belongs in CI as a quick template validator.
That the test matches the suite_template.py pattern and remains low-maintenance.

Post-Merge Tasks

Tasks to complete after merging (if any):

Add the smoke test to CI workflow if not already covered
Announce in project channels (optional)
Create follow-up issues for future work

Summary by CodeRabbit

Tests
- Added a smoke test validating FAQ adapter integration: executes a scenario with a mock adapter/agent, verifies results produced, confirms exactly one run for the target case, asserts run success, checks answer-correctness metric equals 1.0, and ensures relevant traces (agent response and knowledge-base) are recorded.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Mock a simple agent and verify the FAQAdapter template pattern records a response and passes a metric

continue · 2025-12-15T17:19:57Z

Learn more

All Green is an AI agent that automatically:

✅ Addresses code review comments

✅ Fixes failing CI checks

✅ Resolves merge conflicts

coderabbitai · 2025-12-15T17:20:04Z

Walkthrough

Adds a new pytest smoke test that mocks an agent and an adapter, runs one DatasetCase through a Scenario via Runner, and asserts a single successful run for case id "faq_1" with answer_correctness == 1.0.

Changes

Cohort / File(s)	Change Summary
New smoke test for suite_template FAQ adapter `tests/test_suite_template_smoke.py`	Adds a test implementing `MockAgent` and `MockAdapter` (implementing `BaseAdapter`), builds a `DatasetSource` with one `DatasetCase`, wraps it in a `Scenario`, executes it via `Runner`, and asserts results contain one successful run for `faq_1` with `answer_correctness` equal to 1.0.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Review focus: correctness of test imports/setup, adherence to adapter API (AdapterOutcome, traces), and accuracy of assertions.

Possibly related issues

Add sample scenario tests for suite_template.py #4 — Adds a pytest exercising an FAQ adapter flow with a mocked agent and dataset; aligns with this smoke test's intent.

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and specifically describes the main change: adding a smoke test for the suite_template.py documentation template.
Description check	✅ Passed	The description covers the main change, links to issue #7, includes type of change, lists changes made, provides testing details with Python version and OS, and includes a passing test result.
Linked Issues check	✅ Passed	The PR fully addresses all objectives from issue #7: new test file created, MockAgent implemented with .connect() and .answer(), dataset inlined via DatasetSource.from_list, MockAdapter conforming to BaseAdapter, and assertions for success and answer_correctness metric.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to the linked issue #7: only the new test file is added with test implementations focused on validating the FAQAdapter template pattern.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

tests/test_suite_template_smoke.py (1)

34-63: Smoke test covers the core contract; consider a couple of small, optional assertions

The test nicely:

Builds a single DatasetCase, runs it through Scenario/Runner, and asserts exactly one run was recorded for faq_1.

Verifies the run is successful and that answer_correctness is 1.0, which directly guards the FAQAdapter + metric wiring.

Also implicitly smoke-tests the import path for AnswerCorrectnessMetric.

Optional hardening (no need to block on this):

If the run object exposes the adapter output (e.g., recorded_run.output), you could assert it equals the canned answer for a slightly stronger check.

If trace data is exposed on the run, adding a quick assertion that the expected trace events are present (agent response + knowledge base/tool event) would lock in the trace shape the template advertises.

As-is, the test already meets the stated smoke-test objective.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a55eacf and e45bc95.

📒 Files selected for processing (1)

tests/test_suite_template_smoke.py (1 hunks)

🔇 Additional comments (2)

tests/test_suite_template_smoke.py (2)

5-11: MockAgent stub is clean and sufficient for the smoke test

Minimal connect() and answer() behavior keeps the fixture easy to reason about and matches the FAQ-style use case. No changes needed here.

16-33: MockAdapter mirrors the adapter pattern well and captures useful traces

The adapter:

Uses a flexible case.query/case.input fallback, which is handy for keeping the test aligned if the template changes.

Records both the agent response and a tool/knowledge-base event into trace.

Computes success based on the presence and exact match of expected_output, which is appropriate for this smoke test.

No functional issues; this is a solid minimal adapter implementation for exercising the template.

codecov-commenter · 2025-12-15T17:53:48Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

aviralgarg05

Fix the failing workflow, then I will merge

…test Address ruff lint failures: remove unused imports and place module imports at top of file

aviralgarg05

Still failing

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

tests/test_suite_template_smoke.py (1)
17-18: Optional: Explicit return None is redundant.

Python functions return None implicitly, so the explicit return None statements are unnecessary. However, they do make the intent clear and don't harm functionality.

Apply this diff if you prefer more concise code:
-    def prepare(self) -> None:  # pragma: no cover - trivial
-        return None
+    def prepare(self) -> None:  # pragma: no cover - trivial
+        pass
-    def cleanup(self) -> None:  # pragma: no cover - trivial
-        return None
+    def cleanup(self) -> None:  # pragma: no cover - trivial
+        pass
Also applies to: 28-29

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between f4c3990 and b45fb1e.

📒 Files selected for processing (1)

tests/test_suite_template_smoke.py (1 hunks)

🔇 Additional comments (6)

tests/test_suite_template_smoke.py (6)

1-3: LGTM! Imports are appropriate for the smoke test.

The imports cover all necessary components for building and running the test scenario.

5-10: LGTM! MockAgent is appropriately simple for a smoke test.

The mock agent correctly implements the expected interface pattern with .connect() and .answer() methods.

20-26: LGTM! The execute logic is well-implemented.

The method correctly:

Connects to the agent and retrieves the answer

Records trace events for observability

Handles edge cases (None expected_output, whitespace differences)

Returns a properly structured AdapterOutcome

The defensive hasattr check on line 22 is sensible if the template supports both query and input attributes across different dataset schemas.

31-44: LGTM! Test setup is clean and well-structured.

The test case construction is appropriate with matching expected output, and the scenario components are properly initialized following the agentunit pattern.

46-58: LGTM! Test execution and assertions are thorough.

The test properly:

Runs the scenario through the Runner

Validates the result structure

Locates the specific test case by ID

Asserts the run succeeded

59-60: The test assertion is correct and will pass.

The answer_correctness metric is automatically computed by the Runner when no explicit metrics configuration is provided. When RunnerConfig is created without a metrics parameter (as in this test), resolve_metrics(None) returns all metrics from DEFAULT_METRICS, which includes AnswerCorrectnessMetric().

In AnswerCorrectnessMetric.evaluate(), the metric compares the expected output with the actual output. Since both case.expected_output and the resp_text from MockAdapter are identical strings ("This is a canned FAQ answer."), the exact match condition triggers and returns 1.0. The metric is populated in recorded_run.metrics by the Runner._run_scenario() method during execution, so the assertion will pass.

Likely an incorrect or invalid review comment.

aviralgarg05

Still failing

test(suite): add smoke test for docs/templates/suite_template.py

e45bc95

Mock a simple agent and verify the FAQAdapter template pattern records a response and passes a metric

coderabbitai bot reviewed Dec 15, 2025

View reviewed changes

aviralgarg05 requested changes Dec 15, 2025

View reviewed changes

style(tests): fix import ordering and remove unused imports in smoke …

f4c3990

…test Address ruff lint failures: remove unused imports and place module imports at top of file

Vansh-Lohia requested a review from aviralgarg05 December 17, 2025 11:28

aviralgarg05 requested changes Dec 17, 2025

View reviewed changes

style(tests): sort imports in smoke test to satisfy ruff

b45fb1e

coderabbitai bot reviewed Dec 17, 2025

View reviewed changes

aviralgarg05 requested changes Dec 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test(suite): add smoke test for docs/templates/suite_template.py #40

test(suite): add smoke test for docs/templates/suite_template.py #40

Uh oh!

Vansh-Lohia commented Dec 15, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

continue bot commented Dec 15, 2025

Uh oh!

coderabbitai bot commented Dec 15, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

codecov-commenter commented Dec 15, 2025

Uh oh!

aviralgarg05 left a comment

Uh oh!

aviralgarg05 left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

aviralgarg05 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

test(suite): add smoke test for docs/templates/suite_template.py #40

Are you sure you want to change the base?

test(suite): add smoke test for docs/templates/suite_template.py #40

Uh oh!

Conversation

Vansh-Lohia commented Dec 15, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Changes Made

Testing

Test Configuration

Test Results

Code Quality

Documentation

Breaking Changes

Dependencies

Performance Impact

Additional Context

Checklist

Reviewer Notes

Post-Merge Tasks

Summary by CodeRabbit

Uh oh!

continue bot commented Dec 15, 2025

Uh oh!

coderabbitai bot commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related issues

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Dec 15, 2025

Codecov Report

Uh oh!

aviralgarg05 left a comment

Choose a reason for hiding this comment

Uh oh!

aviralgarg05 left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

aviralgarg05 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Vansh-Lohia commented Dec 15, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 15, 2025 •

edited

Loading