Add quantization and tuning ops as part of model compile hash by TedThemistokleous · Pull Request #180 · ROCm/onnxruntime

TedThemistokleous · 2025-10-03T14:16:57Z

Description

Add the quantization, tuning and memory limits as inputs to the final hashed output name fora model. This ensure that we're not reusing a different quantized, or tuned model from a previous session.

Motivation and Context

turns out we weren't passing exhaustive tune flags for the recompile in along with some other iflags like mem_limit

Add quantization and tuning ops as part of model compile hash

c3abdc0

TedThemistokleous requested review from ahsan-ca and apwojcik October 3, 2025 14:17

TedThemistokleous self-assigned this Oct 3, 2025

TedThemistokleous added the Bugfix Fix to a bug or reported issue label Oct 3, 2025

TedThemistokleous added 2 commits October 3, 2025 22:06

Update hash from flags for when we need to recompile a model

de990f7

turns out we weren't passing exhaustive tune flags for the recompile in along with some other iflags like mem_limit

lintrunner pass

c7792d9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add quantization and tuning ops as part of model compile hash#180

Add quantization and tuning ops as part of model compile hash#180
TedThemistokleous wants to merge 3 commits intorocm7.1_internal_testingfrom
add_quant_and_tune_flags_to_hash

TedThemistokleous commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TedThemistokleous commented Oct 3, 2025

Description

Motivation and Context

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant