perf: avoid S3 HEAD requests when downloading job outputs by crowecawcaw · Pull Request #924 · aws-deadline/deadline-cloud

crowecawcaw · 2025-11-19T16:44:34Z

What was the problem/requirement? (What/Why)

Job attachments uses the s3transfer library to download files from S3. When it estimates whether to use multipart downloads or a single download, it makes a HEAD request to get the file size. Job attachments already knows the file size though from the manifest. For small files, the HEAD requests take about half the total download time.

What was the solution? (How)

Provide the file size to s3transfer.

What is the impact of this change?

Roughly halves the transfer time for small files.

How was this change tested?

I created a job that generates 10k small files. It took 3m52s to download before this change and 2m04s after.

Was this change documented?

n/a

Does this PR introduce new dependencies?

No

Is this a breaking change?

No

Does this change impact security?

No

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Signed-off-by: Stephen Crowe <6042774+crowecawcaw@users.noreply.github.com>

sonarqubecloud · 2025-11-19T22:55:59Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

baxeaz · 2025-11-20T16:37:06Z

src/deadline/job_attachments/download.py

+        # Provide a dummy etag to skip HEAD request if the method exists (added in s3transfer 0.6.0).
+        # For downloads from CAS, we don't need etag validation since files are content-addressed.
+        # Older s3transfer versions don't have this method, so we check before calling.
+        if hasattr(future.meta, "provide_object_etag"):


Is there a way to just check this once and cache it? (Or just do a version check of some sort) just to avoid looking it up for every on_queued where it shouldn't be changing?

Each future is a distinct instance, though surely if one has this attribute, they all would. Do you think this check would be slow enough to warrant optimization though? There will be 1 future per file.

Yeah I was wondering that as well. It might not be terribly noticeable tbh? It seems a bit inefficient when every future will be the same but maybe not enough to really care about a hash/lookup per file

jericht · 2025-11-20T16:37:50Z

src/deadline/job_attachments/download.py

+        # For downloads from CAS, we don't need etag validation since files are content-addressed.
+        # Older s3transfer versions don't have this method, so we check before calling.
+        if hasattr(future.meta, "provide_object_etag"):
+            future.meta.provide_object_etag("dummy-etag")


Setting this value skips HeadObject requests? Can we link to docs for reference in the code comment above?

The s3transfer library actually does not have documentation: https://github.com/boto/s3transfer/

This behavior was discovered in the Python version matrix testing. In lieu of documentation, this change has unit tests which verify the behavior we want (i.e. not calling HEAD).

One concern from the team is this seems to be an internal implementation detail of S3 transfer. I appreciate the improvements, but also need to be careful end to end (JA, worker running jobs) to avoid another regression.

JA is a bit fragile as you have noticed so we want to check everything first.

Can we do what was implemented for Incremental downloads here: https://github.com/aws-deadline/deadline-cloud/blob/mainline/src/deadline/job_attachments/_incremental_downloads/_manifest_s3_downloads.py#L533-L537

It will not depend on an internal API.

Sure, that would work. It adds some complexity of having two download paths and two download managers (s3transfer and an internal thread pool), but it does avoid touching s3transfer details. If the JA team prefers that, I can close this PR.

leongdl · 2025-11-20T16:49:58Z

test/unit/deadline_job_attachments/test_download.py

        stubber = Stubber(s3_client)
        stubber.add_client_error(
-            "head_object",
+            "get_object",


Nit - is this due to small size at like 1797?

s3transfer calls HEAD to get the file size to determine whether it's small enough to get in a signle request or if it should use its multirequest approach. Since we provide the size, it never makes a HEAD request. The first request that fails when permissions are missing is the GET.

leongdl · 2025-11-20T16:55:50Z

src/deadline/job_attachments/download.py

 download_logger = getLogger("deadline.job_attachments.download")

+
+class _FileSizeSubscriber(_BaseSubscriber):


Can we please run the integration tests? This code path does change both deadline job download, incremental downloads.

I need to check how this impacts the worker agent too, if its in the call path we need to run the integration tests there.

I ran the integration tests, they passed.

crowecawcaw · 2025-12-10T17:05:28Z

Closing this PR in favor of one on s3transfer itself: boto/s3transfer#363

crowecawcaw requested a review from a team as a code owner November 19, 2025 16:44

crowecawcaw requested a review from mwiebe November 19, 2025 16:44

github-actions bot added the waiting-on-maintainers Waiting on the maintainers to review. label Nov 19, 2025

epmog added the job attachments For an issue with job attachments label Nov 19, 2025

crowecawcaw force-pushed the smallfiles branch 2 times, most recently from 328e747 to bc860f9 Compare November 19, 2025 17:41

crowecawcaw requested a review from a team as a code owner November 19, 2025 17:41

crowecawcaw force-pushed the smallfiles branch 2 times, most recently from b99c6fb to 3ecdb28 Compare November 19, 2025 18:50

perf: avoid S3 HEAD requests when downloading job outputs

1d79be9

Signed-off-by: Stephen Crowe <6042774+crowecawcaw@users.noreply.github.com>

crowecawcaw force-pushed the smallfiles branch from 3ecdb28 to 1d79be9 Compare November 19, 2025 20:52

crowecawcaw enabled auto-merge (squash) November 19, 2025 20:58

crowecawcaw added 3 commits November 19, 2025 14:18

avoid public interface change

f3dc553

Signed-off-by: Stephen Crowe <6042774+crowecawcaw@users.noreply.github.com>

Merge branch 'mainline' into smallfiles

c7d1ca0

Merge branch 'mainline' into smallfiles

0fbf54f

baxeaz reviewed Nov 20, 2025

View reviewed changes

jericht reviewed Nov 20, 2025

View reviewed changes

leongdl reviewed Nov 20, 2025

View reviewed changes

sakshie95 removed the waiting-on-maintainers Waiting on the maintainers to review. label Nov 24, 2025

crowecawcaw closed this Dec 10, 2025

auto-merge was automatically disabled December 10, 2025 17:05
Pull request was closed

		download_logger = getLogger("deadline.job_attachments.download")


		class _FileSizeSubscriber(_BaseSubscriber):

Conversation

crowecawcaw commented Nov 19, 2025

What was the problem/requirement? (What/Why)

What was the solution? (How)

What is the impact of this change?

How was this change tested?

Was this change documented?

Does this PR introduce new dependencies?

Is this a breaking change?

Does this change impact security?

No

Uh oh!

sonarqubecloud bot commented Nov 19, 2025

Quality Gate passed

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

crowecawcaw commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants