-
Notifications
You must be signed in to change notification settings - Fork 104
Open
Labels
P1: Should haveNecessary but not criticalNecessary but not criticalgood first issueGood for newcomersGood for newcomershelps: rapidsHelps or needed by RAPIDSHelps or needed by RAPIDStopic: performancePerformance related issuePerformance related issuetype: improvementImprovement / enhancement to an existing functionImprovement / enhancement to an existing function
Description
The Programmatic Dependent Launch (PDL) mechanism allows for a dependent secondary kernel to launch before the primary kernel it depends on in the same CUDA stream has finished executing. Available starting with devices of compute capability 9.0, this technique can provide performance benefits when the secondary kernel can complete significant work that does not depend on the results of the primary kernel.
Cuco kernels should all support PDL.
Metadata
Metadata
Assignees
Labels
P1: Should haveNecessary but not criticalNecessary but not criticalgood first issueGood for newcomersGood for newcomershelps: rapidsHelps or needed by RAPIDSHelps or needed by RAPIDStopic: performancePerformance related issuePerformance related issuetype: improvementImprovement / enhancement to an existing functionImprovement / enhancement to an existing function