feat(scanner): remove automatic WFP file generation during scan #175

isasmendiagus · 2026-01-02T11:35:04Z

Summary

Scan command no longer generates scanner_output.wfp file
Removed --no-wfp-output flag (no longer needed)

Tested scenarios:

Scan folder produces no WFP file
Fingerprint command still works (-o for file output)
Scan existing WFP with -w still works
--no-wfp-output flag properly removed (gives error if used)

Summary by CodeRabbit

Changed
- Scan command no longer generates scanner output (WFP) files during operation
- Removed --no-wfp-output command-line flag (no longer needed)
Documentation
- Updated docs and changelog to reflect removal of the WFP option and to publish version 1.43.0 release notes

_{✏️ Tip: You can customize this high-level summary in your review settings.}

…uring scan

coderabbitai · 2026-01-02T11:35:13Z

📝 Walkthrough

Walkthrough

Version bumped to 1.43.0; the --no-wfp-output CLI flag and implicit WFP output behavior were removed. Scanner API and CLI flow were refactored so WFP handling requires explicit file path parameters.

Changes

Cohort / File(s)	Summary
Version & Docs `CHANGELOG.md`, `docs/source/index.rst`, `src/scanoss/__init__.py`	Bumped version to `1.43.0`; release notes added; documentation entry for `--no-wfp-output` removed.
CLI `src/scanoss/cli.py`	Removed `--no-wfp-output` argument and pre-scan directory writability check; Scanner call no longer receives `no_wfp_file`.
Scanner core `src/scanoss/scanner.py`	Removed constructor `wfp` parameter and internal `self.wfp`; renamed/updated methods to require explicit `wfp_file` args (`scan_wfp_with_options`, `scan_wfp_file_threaded`), removed `scan_wfp_file`, and eliminated implicit WFP generation in non-threaded paths.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant User
  participant CLI as CLI/parser
  participant Scanner
  participant FS as FileSystem

  rect rgb(240,248,255)
    Note right of CLI: New flow (explicit WFP)
  end

  User->>CLI: run `scan` (with optional --wfp-file)
  CLI->>Scanner: invoke scan(..., wfp_file?) 
  alt wfp_file provided
    Scanner->>FS: open/write/read specified wfp_file
    FS-->>Scanner: wfp_file content / write ack
    Scanner-->>CLI: scan results (WFP-included)
  else no wfp_file
    Scanner->>FS: perform scan without writing WFP
    Scanner-->>CLI: scan results (no WFP)
  end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

Feature/mscasso/sp 3430 add line headers filter #172: Modifies Scanner constructor usage and CLI plumbing—overlaps with this PR's Scanner/CLI changes.
chore:upgrades version to v1.39.0 #159: Updates __version__ in src/scanoss/__init__.py, overlapping version bump work.

Suggested reviewers

eeisegn
matiasdaloia

Poem

🐰 A hop and a nibble, the WFP is named,
No secret file left by default to be framed.
Scanner asks clearly, "Where should I write?"
Now paths are explicit, and everything's bright. 🥕

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: removing automatic WFP file generation during scan operations, which aligns with the comprehensive changes across CHANGELOG, documentation, CLI, and Scanner modules.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a1a8422 and 8d06bac.

📒 Files selected for processing (1)

src/scanoss/scanner.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build

🔇 Additional comments (2)

src/scanoss/scanner.py (2)

775-829: LGTM! Required parameter makes intent explicit.

The signature change from optional to required parameter eliminates the default value, making it clear that a WFP file path is mandatory. This aligns with the PR objective to remove implicit WFP handling. The parameter renaming to wfp_file improves consistency with scan_wfp_with_options, and the call site at line 768 correctly passes the required argument.

745-773: LGTM! Parameter renaming improves clarity.

The parameter rename from wfp to wfp_file makes the intent explicit and aligns with the PR objective to require explicit WFP file paths. The updated docstrings and error messages are consistent and clear.

All call sites already use positional arguments (e.g., src/scanoss/cli.py:1606), so the rename is fully compatible with existing code.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-01-02T11:35:50Z

SCANOSS SCAN Completed 🚀

Detected components: 1
Undeclared components: 0
Declared components: 1
Detected files: 103
Detected files undeclared: 0
Detected files declared: 103
Licenses detected: 1
Licenses detected with copyleft: 0
Policies: ✅ 1 pass (1 total)

View more details on SCANOSS Action Summary

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

CHANGELOG.md (1)
13-14: Optional: Consider adjusting list indentation.

The linter suggests zero indentation for list items. While the current 2-space indentation under section headers is a common pattern in changelogs, adjusting to zero indentation would satisfy the markdownlint rule.
🔎 Proposed adjustment
 ## [1.43.0] - 2026-01-02
 ### Changed
-  - Scan command no longer generates `scanner_output.wfp` file
-  - Removed `--no-wfp-output` flag (no longer needed)
+- Scan command no longer generates `scanner_output.wfp` file
+- Removed `--no-wfp-output` flag (no longer needed)
src/scanoss/scanner.py (1)

775-829: Consider refactoring to reduce cyclomatic complexity.

The pipeline reports this method exceeds the complexity threshold (13 branches vs. 12 max). While the parameter rename in this PR doesn't introduce new complexity, consider extracting portions of the WFP parsing logic into helper methods to improve maintainability.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 1e0f941 and a1a8422.

📒 Files selected for processing (5)

CHANGELOG.md
docs/source/index.rst
src/scanoss/__init__.py
src/scanoss/cli.py
src/scanoss/scanner.py

💤 Files with no reviewable changes (2)

src/scanoss/cli.py
docs/source/index.rst

🧰 Additional context used

🪛 GitHub Actions: Lint

src/scanoss/scanner.py

[error] 775-775: PLR0912 Too many branches (13 > 12).

🪛 markdownlint-cli2 (0.18.1)

CHANGELOG.md

13-13: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)

14-14: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build

🔇 Additional comments (4)

CHANGELOG.md (1)

11-14: LGTM! Changelog entries are clear and accurate.

The changelog correctly documents the breaking changes for version 1.43.0, aligning with the PR objectives.

src/scanoss/__init__.py (1)

25-25: LGTM! Version bump correctly reflects the release.

The version update to 1.43.0 aligns with the changelog and breaking changes introduced in this PR.

src/scanoss/scanner.py (2)

745-773: LGTM! Parameter rename improves API clarity.

The rename from wfp to wfp_file makes it explicit that this parameter expects a file path rather than WFP content. Error messages have been consistently updated.

775-775: LGTM! Parameter changes enforce explicit WFP file specification.

The rename to wfp_file and removal of the optional default value improve API consistency and make the requirement for a WFP file path explicit.

Added # noqa: PLR0912 to bypass the linter's branch limit (13 > 12). The extra branches come from separate validation checks that provide clearer error messages to users. Combining them would reduce clarity.

github-actions · 2026-01-02T11:45:25Z

SCANOSS SCAN Completed 🚀

Detected components: 1
Undeclared components: 0
Declared components: 1
Detected files: 103
Detected files undeclared: 0
Detected files declared: 103
Licenses detected: 1
Licenses detected with copyleft: 0
Policies: ✅ 1 pass (1 total)

View more details on SCANOSS Action Summary

isasmendiagus · 2026-01-05T07:42:05Z

src/scanoss/cli.py

-    if not os.access(os.getcwd(), os.W_OK):  # Make sure the current directory is writable. If not disable saving WFP
-        print_stderr(f'Warning: Current directory is not writable: {os.getcwd()}')
-        args.no_wfp_output = True


Removed because the CLI no longer writes to the scan folder

isasmendiagus · 2026-01-05T07:42:50Z

src/scanoss/scanner.py

-    def scan_wfp_file(self, file: str = None) -> bool:  # noqa: PLR0912, PLR0915
-        """
-        Scan the contents of the specified WFP file (in the current process)
-        :param file: Scan the contents of the specified WFP file (in the current process)
-        :return: True if successful, False otherwise
-        """
-        success = True
-        wfp_file = file if file else self.wfp  # If a WFP file is specified, use it, otherwise us the default
-        if not os.path.exists(wfp_file) or not os.path.isfile(wfp_file):
-            raise Exception(f'ERROR: Specified WFP file does not exist or is not a file: {wfp_file}')
-        file_count = Scanner.__count_files_in_wfp_file(wfp_file)
-        cur_files = 0
-        cur_size = 0
-        batch_files = 0
-        wfp = ''
-        max_component = {'name': '', 'hits': 0}
-        components = {}
-        self.print_debug(f'Found {file_count} files to process.')
-        raw_output = '{\n'
-        file_print = ''
-        bar_ctx = Bar('Scanning', max=file_count) if (not self.quiet and self.isatty) else nullcontext()
-
-        with bar_ctx as bar:
-            if bar:
-                bar.next(0)
-            with open(wfp_file) as f:
-                for line in f:
-                    if line.startswith(WFP_FILE_START):
-                        if file_print:
-                            wfp += file_print  # Store the WFP for the current file
-                            cur_size = len(wfp.encode('utf-8'))
-                        file_print = line  # Start storing the next file
-                        cur_files += 1
-                        batch_files += 1
-                    else:
-                        file_print += line  # Store the rest of the WFP for this file
-                    l_size = cur_size + len(file_print.encode('utf-8'))
-                    # Hit the max post size, so sending the current batch and continue processing
-                    if l_size >= self.max_post_size and wfp:
-                        self.print_debug(
-                            f'Sending {batch_files} ({cur_files}) of'
-                            f' {file_count} ({len(wfp.encode("utf-8"))} bytes) files to the ScanOSS API.'
-                        )
-                        if self.debug and cur_size > self.max_post_size:
-                            Scanner.print_stderr(
-                                f'Warning: Post size {cur_size} greater than limit {self.max_post_size}'
-                            )
-                        scan_resp = self.scanoss_api.scan(wfp, max_component['name'])  # Scan current WFP and store
-                        if bar:
-                            bar.next(batch_files)
-                        if scan_resp is not None:
-                            for key, value in scan_resp.items():
-                                raw_output += '  "%s":%s,' % (key, json.dumps(value, indent=2))
-                                for v in value:
-                                    if hasattr(v, 'get'):
-                                        if v.get('id') != 'none':
-                                            vcv = '%s:%s:%s' % (v.get('vendor'), v.get('component'), v.get('version'))
-                                            components[vcv] = components[vcv] + 1 if vcv in components else 1
-                                            if max_component['hits'] < components[vcv]:
-                                                max_component['name'] = v.get('component')
-                                                max_component['hits'] = components[vcv]
-                                    else:
-                                        Scanner.print_stderr(f'Warning: Unknown value: {v}')
-                        else:
-                            success = False
-                        batch_files = 0
-                        wfp = ''
-            if file_print:
-                wfp += file_print  # Store the WFP for the current file
-            if wfp:
-                self.print_debug(
-                    f'Sending {batch_files} ({cur_files}) of'
-                    f' {file_count} ({len(wfp.encode("utf-8"))} bytes) files to the ScanOSS API.'
-                )
-                scan_resp = self.scanoss_api.scan(wfp, max_component['name'])  # Scan current WFP and store
-                if bar:
-                    bar.next(batch_files)
-                first = True
-                if scan_resp is not None:
-                    for key, value in scan_resp.items():
-                        if first:
-                            raw_output += '  "%s":%s' % (key, json.dumps(value, indent=2))
-                            first = False
-                        else:
-                            raw_output += ',\n  "%s":%s' % (key, json.dumps(value, indent=2))
-                else:
-                    success = False
-        raw_output += '\n}'
-        if self.output_format == 'plain':
-            self.__log_result(raw_output)
-        elif self.output_format == 'cyclonedx':
-            cdx = CycloneDx(self.debug, self.scan_output)
-            cdx.produce_from_str(raw_output)
-        elif self.output_format == 'spdxlite':
-            spdxlite = SpdxLite(self.debug, self.scan_output)
-            success = spdxlite.produce_from_str(raw_output)
-        elif self.output_format == 'csv':
-            csvo = CsvOutput(self.debug, self.scan_output)
-            csvo.produce_from_str(raw_output)
-        else:
-            self.print_stderr(f'ERROR: Unknown output format: {self.output_format}')
-            success = False
-
-        return success
-


Dead code, CLI always uses scan_wfp_file_threaded

matiasdaloia

LGTM

isasmendiagus added 4 commits January 2, 2026 11:56

refactor(scanner): remove dead code scan_wfp_file() never called

5076f24

feat(scanner)!: remove --no-wfp-output flag and auto WFP generation d…

b721ec3

…uring scan

docs(cli): remove --no-wfp-output flag from documentation

ca39a6a

chore(release): prepare for the 1.43.0 release

a1a8422

isasmendiagus requested review from agustingroh and eeisegn January 2, 2026 11:35

isasmendiagus changed the title ~~Feat/sp 3880/remove wfp output~~ feat(scanner): remove automatic WFP file generation during scan Jan 2, 2026

coderabbitai bot reviewed Jan 2, 2026

View reviewed changes

fix(lint): suppress "too many branches" in scan_wfp_file_threaded

8d06bac

Added # noqa: PLR0912 to bypass the linter's branch limit (13 > 12). The extra branches come from separate validation checks that provide clearer error messages to users. Combining them would reduce clarity.

isasmendiagus commented Jan 5, 2026

View reviewed changes

matiasdaloia approved these changes Jan 5, 2026

View reviewed changes

isasmendiagus merged commit f953395 into main Jan 5, 2026
6 checks passed

isasmendiagus deleted the feat/SP-3880/remove-wfp-output branch January 5, 2026 08:07

coderabbitai bot mentioned this pull request Jan 5, 2026

chore(cli): deprecate --no-wfp-output flag for backwards compatibility #176

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(scanner): remove automatic WFP file generation during scan #175

feat(scanner): remove automatic WFP file generation during scan #175

Uh oh!

isasmendiagus commented Jan 2, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jan 2, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

github-actions bot commented Jan 2, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

github-actions bot commented Jan 2, 2026

Uh oh!

isasmendiagus Jan 5, 2026

Uh oh!

isasmendiagus Jan 5, 2026 •

edited

Loading

Uh oh!

matiasdaloia left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(scanner): remove automatic WFP file generation during scan #175

feat(scanner): remove automatic WFP file generation during scan #175

Uh oh!

Conversation

isasmendiagus commented Jan 2, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tested scenarios:

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

github-actions bot commented Jan 2, 2026

SCANOSS SCAN Completed 🚀

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jan 2, 2026

SCANOSS SCAN Completed 🚀

Uh oh!

isasmendiagus Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

isasmendiagus Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matiasdaloia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

isasmendiagus commented Jan 2, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 2, 2026 •

edited

Loading

isasmendiagus Jan 5, 2026 •

edited

Loading