[AURON #1746] Introduce NativeTakeOrderedAndProjectExec to fuse TakeOrdered + Project #1747

yew1eb · 2025-12-12T09:49:29Z

Which issue does this PR close?

Closes #1746

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

How was this patch tested?

… TakeOrdered + Project

Copilot

Pull request overview

This PR introduces NativeTakeOrderedAndProjectExec to fuse the TakeOrdered and Project operations into a single native operator, improving performance by eliminating an intermediate operator. Previously, TakeOrderedAndProjectExec was converted to separate NativeTakeOrderedExec and NativeProjectExec operators, but now they're combined into a single fused operator.

Key changes:

Renamed base classes from NativeTakeOrderedBase to NativeTakeOrderedAndProjectBase and added projectList parameter to support projection
Updated the converter to pass projectList directly to the native operator instead of wrapping with a separate ProjectExec
Modified both executeCollect() and doExecuteNative() methods to apply projection when needed

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
`NativeTakeOrderedAndProjectBase.scala`	Renamed base classes and added projection support; applies projection in both executeCollect and doExecuteNative paths
`Shims.scala`	Updated interface method signatures to include projectList parameter and return renamed types
`AuronConverters.scala`	Simplified conversion logic by passing projectList directly instead of creating separate ProjectExec
`NativeTakeOrderedAndProjectExec.scala`	Updated concrete implementation with new projectList parameter
`NativePartialTakeOrderedExec.scala`	Updated to extend renamed base class
`ShimsImpl.scala`	Updated implementation to match new interface signatures
`AuronExecSuite.scala`	Added test coverage for both executeCollect and doExecuteNative paths

Comments suppressed due to low confidence (3)

spark-extension/src/main/scala/org/apache/spark/sql/execution/auron/plan/NativeTakeOrderedAndProjectBase.scala:170

The class name NativePartialTakeOrderedAndProjectBase is misleading because this partial execution operator does not actually apply projection. The projection only happens in the final stage (in NativeTakeOrderedAndProjectBase). Consider keeping the original name NativePartialTakeOrderedBase or clarifying that this is just a partial step of the TakeOrderedAndProject operation.
spark-extension/src/main/scala/org/apache/spark/sql/execution/auron/plan/NativeTakeOrderedAndProjectBase.scala:218
The friendlyName "PartialTakeOrderedAndProject" is misleading because this partial execution step does not apply projection. The projection only happens in the final stage. Consider using "PartialTakeOrdered" to better reflect what this stage actually does.
spark-extension/src/main/scala/org/apache/spark/sql/execution/auron/plan/NativeTakeOrderedAndProjectBase.scala:131
The early return at line 131 does not apply the projection. When partial.outputPartitioning.numPartitions <= 1, the method returns the partial result directly without applying the project transformation. This means that if projectList != child.output, the returned data will have incorrect columns. The early return should also handle projection similar to how it's done later in the method (lines 158-164).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

spark-extension-shims-spark/src/test/scala/org.apache.auron/AuronExecSuite.scala

yew1eb · 2025-12-16T10:51:30Z

cc @richox

richox · 2025-12-26T07:31:36Z

...c/main/scala/org/apache/spark/sql/execution/auron/plan/NativeTakeOrderedAndProjectBase.scala

    // take top-K from the final partition
    new NativeRDD(
      sparkContext,
      metrics = SparkMetricNode(metrics, shuffledRDD.metrics :: Nil),


MetricNode tree needs to be consistent with native plans, in this case there are two nested native plans but only one MetricNode, all the metrics in this plan will be updated to the wrong place.

github-actions bot added the spark label Dec 12, 2025

[AURON apache#1746] Introduce NativeTakeOrderedAndProjectExec to fuse…

1b1eb04

… TakeOrdered + Project

yew1eb force-pushed the refactor_takeOrderedAndProject branch from 0589ddf to 1b1eb04 Compare December 12, 2025 10:01

cxzl25 requested a review from Copilot December 12, 2025 10:55

Copilot started reviewing on behalf of cxzl25 December 12, 2025 10:56 View session

Copilot AI reviewed Dec 12, 2025

View reviewed changes

spark-extension-shims-spark/src/test/scala/org.apache.auron/AuronExecSuite.scala Show resolved Hide resolved

richox reviewed Dec 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AURON #1746] Introduce NativeTakeOrderedAndProjectExec to fuse TakeOrdered + Project #1747

[AURON #1746] Introduce NativeTakeOrderedAndProjectExec to fuse TakeOrdered + Project #1747

yew1eb commented Dec 12, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

yew1eb commented Dec 16, 2025

Uh oh!

richox Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[AURON #1746] Introduce NativeTakeOrderedAndProjectExec to fuse TakeOrdered + Project #1747

Are you sure you want to change the base?

[AURON #1746] Introduce NativeTakeOrderedAndProjectExec to fuse TakeOrdered + Project #1747

Conversation

yew1eb commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

How was this patch tested?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

yew1eb commented Dec 16, 2025

Uh oh!

richox Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yew1eb commented Dec 12, 2025 •

edited

Loading