Regarding the method for computing the Hessian matrix.

I would like to ask about line 61 in your gptq.py file: `inp = math.sqrt(2 / self.nsamples) * inp.float()`. According to the paper, it seems that it should be written as follows: `inp = math.sqrt(tmp / self.nsamples) * inp.float()`. After making this modification, I noticed a reduction in quantization error. Could you please verify if my understanding is correct, and if there might be any misunderstanding on my part?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Regarding the method for computing the Hessian matrix. #51

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Regarding the method for computing the Hessian matrix. #51

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions