Added multimodal query support for VLM Embed by smasurekar · Pull Request #362 · NVIDIA-AI-Blueprints/rag

smasurekar · 2026-02-17T07:36:24Z

Concatenate multimodal content for VLM Embed

Multimodal retriever queries (text + image) are now concatenated into a single string for VLM embedding instead of returning only the image URL, so the embed model receives both text and image in one query. Text is joined with \n\n, then the image URL is appended (one image supported). Unit tests updated for the new concatenated format.

src/nvidia_rag/rag_server/main.py

Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com>

smasurekar requested a review from nv-pranjald February 17, 2026 07:36

smasurekar added the enhancement New feature or request label Feb 17, 2026

smasurekar closed this Feb 17, 2026

smasurekar reopened this Feb 17, 2026

smasurekar force-pushed the dev/smasurekar/vlm-embed-multimodal-query branch from 5bda440 to 4564039 Compare February 17, 2026 09:19

smasurekar requested a review from nv-nikkulkarni February 17, 2026 10:38

nv-pranjald reviewed Feb 17, 2026

View reviewed changes

src/nvidia_rag/rag_server/main.py Show resolved Hide resolved

nv-pranjald approved these changes Feb 17, 2026

View reviewed changes

smasurekar force-pushed the dev/smasurekar/vlm-embed-multimodal-query branch from 4564039 to 7487c01 Compare February 19, 2026 05:20

shubhadeepd added the release-26.03 label Feb 26, 2026

smasurekar force-pushed the dev/smasurekar/vlm-embed-multimodal-query branch from 7487c01 to 95f7dcd Compare February 27, 2026 06:33

Concatenate multimodal content for VLM Embed

4a78d0d

Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com>

smasurekar force-pushed the dev/smasurekar/vlm-embed-multimodal-query branch from 95f7dcd to 4a78d0d Compare February 27, 2026 12:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added multimodal query support for VLM Embed#362

Added multimodal query support for VLM Embed#362
smasurekar wants to merge 1 commit intodevelopfrom
dev/smasurekar/vlm-embed-multimodal-query

smasurekar commented Feb 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

smasurekar commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Concatenate multimodal content for VLM Embed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

smasurekar commented Feb 17, 2026 •

edited

Loading