Refactor & Enhance: Inference Pipeline, ONNX Export, and Core Modules #19

dnth · 2025-04-02T05:55:36Z

This PR introduces significant enhancements and refactoring across the codebase, focusing on improving the inference pipeline, streamlining ONNX export, enhancing core module efficiency, and updating documentation and setup.

Key Changes:

Inference & Visualization:
- Refactored scripts/live_inference.py for improved performance, readability, and added support for video file input alongside existing webcam and image options.
- Enhanced the draw_boxes function for clearer object detection visualization using OpenCV, including accurate scaling/padding adjustments for bounding boxes.
- Improved the Gradio demo (gradio_demo.py) visualization logic.
- Added FPS, provider, and resolution display overlays to live inference outputs.
ONNX Export:
- Introduced a PreprocessingModule to optionally embed image preprocessing steps (resizing, color conversion, normalization) directly into the exported ONNX graph.
- Updated the export.ipynb notebook with detailed instructions and options for ONNX model export.
Core Module Refactoring & Fixes:
- Optimized tensor concatenation in box_ops.py (box_xyxy_to_cxcywh) and dfine_utils.py (distance2bbox) using torch.cat.
- Improved tensor flattening in hybrid_encoder.py using reshape.
- Updated the PostProcessor in postprocess.py to handle orig_target_sizes as an optional input.
- Refactored components within the DFINE Transformer (dfine_transformer.py).
- Corrected label format handling in coco_eval.py (CocoEvaluator).
Training & Configuration:
- Updated dataset paths and configurations in train.py for the Rock Paper Scissors dataset.
- Adjusted the flat_epoch value in deim_hgnetv2_n_coco.yml.

…istent formatting and clarifying input parameters for video and image inference.

dnth added 30 commits March 31, 2025 05:29

train successfully

3dd5820

update exporter

82ff375

add sample inference for image

1dd1191

working inference

03819c2

working video det

46496a7

add cuda and trt ep

1967b29

working webcam inference

ebb4123

update

cb9b321

use cuda and trt for image inference

7b93a27

add video inference

085abcc

add webcam specs

141c142

update readme

aba0d1c

update gradio demo

1da7f63

update readme

a1a605d

update command

f3c6ded

update readme

298868f

update

33c480c

update

3e5f9df

update

75bf332

update

94184c8

Refine README instructions for live inference commands, ensuring cons…

2453ee5

…istent formatting and clarifying input parameters for video and image inference.

update

34be469

udpate

1bd403d

update

f1e3cdd

Update README.md

2708b88

update

50727ee

update

4926ec9

Update README.md

a3951b6

Update README.md

bda06dd

update quickstart

688b4d5

dnth added 11 commits April 2, 2025 17:23

add credit

158b793

update

fc1ace4

update

5578640

update

083c802

update

eaf1bf4

update

66b6ea9

update readme

3f09ecc

add dash

cd17ba8

update key features

09f87ed

updte

7180e6d

add openvino export

a47979a

dnth merged commit 6317d66 into main Apr 2, 2025
3 checks passed

dnth deleted the pinto branch April 2, 2025 14:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor & Enhance: Inference Pipeline, ONNX Export, and Core Modules #19

Refactor & Enhance: Inference Pipeline, ONNX Export, and Core Modules #19

Uh oh!

dnth commented Apr 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Refactor & Enhance: Inference Pipeline, ONNX Export, and Core Modules #19

Refactor & Enhance: Inference Pipeline, ONNX Export, and Core Modules #19

Uh oh!

Conversation

dnth commented Apr 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants