change resampler attention to scale_dot_product_attention by Benkovichnikita · Pull Request #446 · tencent-ailab/IP-Adapter

Benkovichnikita · 2024-11-19T19:48:50Z

I noticed two issues:

PerceiverAttention consumes a huge amount of memory while calculating an attention map
There are two operations of math.srqt when scale calculating: scale = 1 / math.sqrt(math.sqrt(self.dim_head))

Both of them can be fixed using more effective implementation: F.scaled_dot_product_attention(q, k, v).

PS
If you need scale with two sqrt, it can be provided as an additional argument to F.scaled_dot_product_attention.

change resampler attention to scale_dot_product_attention

9068564

Provide feedback