ptx: Prevent Context use-after-free in finalizers #113

tomsmeding · 2025-11-22T13:13:36Z

Description

See the Note in accelerate-llvm-ptx/src/Data/Array/Accelerate/LLVM/PTX/Context.hs .

How has this been tested?

This is somewhat tricky to test, as one needs to let GC run after the PTX context has no users any more. This was my test file:

{-# LANGUAGE OverloadedStrings #-}
module Main where

import Control.Concurrent (threadDelay)
import Control.Monad
import System.IO (hFlush, stdout)
import qualified Data.Array.Accelerate as A
import qualified Data.Array.Accelerate.Debug.Internal as A
import qualified Data.Array.Accelerate.LLVM.PTX as GPU

main :: IO ()
main = do
  print $ GPU.run $ A.sum (A.generate (A.I1 10000) (\(A.I1 i) -> A.toFloating i :: A.Exp Float))
  forM_ [1..5] $ \_ -> do
    threadDelay 1000000
    putChar '*' >> hFlush stdout
  A.traceM A.verbose "done"

Furthermore, I added additional debug prints in the finalizers of arrays and modules — as far as I can tell these are the only places where a finalizer uses a CUDA context. These manual prints were necessary because simply passing +ACC -ddump-gc made the problem disappear, seemingly because more things were retained somehow.

The program above reliably fails on my machine (cuda 12) and Jizo (cuda 13) before this PR, and reliably succeeds after; furthermore, my debug prints indicate that finalization order is indeed nondeterministic between the 1 module, 2 arrays and 1 context allocated in the above program — I've observed every possible order (apart from the two arrays, which I didn't bother to distinguish in the output). The STM-based synchronisation introduced in this PR seems to properly ensure that resources are explicitly freed only if the Context isn't already destroyed.

No automated test was added because this is tricky to do in an automated setting where the context is retained over invocations; creating a new context for the test would be possible. Do we want that?

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have added tests to cover my changes.
All new and existing tests passed.

ptx: Prevent Context use-after-free in finalizers

c14d05a

tomsmeding force-pushed the ptx-fix-finalizers branch from a5840f1 to 877d10f Compare November 25, 2025 18:14

tomsmeding added 2 commits November 25, 2025 21:15

ptx: Also protect Event finalizer from Context rug-pull

750e403

ptx: Switch TVar to IORef for Context finalizer

fb72a14

tomsmeding force-pushed the ptx-fix-finalizers branch from 877d10f to fb72a14 Compare November 25, 2025 20:16

ptx: Fix [Finalizing a CUDA Context] note

228afd0

tomsmeding force-pushed the ptx-fix-finalizers branch from 74b9fbf to 228afd0 Compare November 25, 2025 20:57

ivogabe merged commit 3025e30 into master Nov 25, 2025
4 of 54 checks passed

tomsmeding deleted the ptx-fix-finalizers branch November 25, 2025 21:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ptx: Prevent Context use-after-free in finalizers #113

ptx: Prevent Context use-after-free in finalizers #113

Uh oh!

tomsmeding commented Nov 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ptx: Prevent Context use-after-free in finalizers #113

ptx: Prevent Context use-after-free in finalizers #113

Uh oh!

Conversation

tomsmeding commented Nov 22, 2025

Description

How has this been tested?

Types of changes

Checklist:

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants