Add inference script and support for different input image shapes #8

Edmund-a7 · 2025-09-07T18:02:38Z

Brief:

Add H and W parameters in vggt.py、eval_utils.py、aggregator.py、attention.py，enabling the model to automatically handle image sizes.
Add inference.py

Hi! I've made a couple of improvements to the project.

Problem:

The repository was missing an inference.py script to run the model on custom datasets.
In the attention mechanism, the token height and width were hardcoded. This isn't ideal, as they should adapt to the processed image dimensions.

Solution:

I've added a new inference.py script.
I've modified the attention code to automatically calculate the tokens based on the image size (specifically, W / 14 and H / 14 in attention.py:161).

I have tested the code successfully. This makes the model more robust and easier to use for inference. Please let me know if you have any feedback!

- Add H and W in vggt.py、eval_utils.py、aggregator.py、attention.py，enabling the model to automatically handle image sizes. - Add inference.py

mystorm16 · 2025-09-08T06:53:38Z

Hi @Edmund-a7, thank you for your work on FastVGGT. We’ve updated the evaluation for the custom dataset, including the related point cloud and pose visualizations. Following your suggestion, we also modified the hardcoded height and width. Please give it a try!

mystorm16 and others added 4 commits September 5, 2025 14:25

Create LICENSE.txt

22915b8

Update README.md

7ba0b42

Update README.md

819aa3a

refactor to support for different input image shapes.

ab78594

- Add H and W in vggt.py、eval_utils.py、aggregator.py、attention.py，enabling the model to automatically handle image sizes. - Add inference.py

mystorm16 force-pushed the main branch from 819aa3a to f2bae42 Compare September 8, 2025 06:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inference script and support for different input image shapes #8

Add inference script and support for different input image shapes #8

Uh oh!

Edmund-a7 commented Sep 7, 2025

Uh oh!

mystorm16 commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add inference script and support for different input image shapes #8

Are you sure you want to change the base?

Add inference script and support for different input image shapes #8

Uh oh!

Conversation

Edmund-a7 commented Sep 7, 2025

Uh oh!

mystorm16 commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants