This is proposed addition for performance data logging #215

carll99 · 2025-12-09T21:59:06Z

Here is the first pass at what we would like to see enabled for performance tracking.

Posting this as requested from earlier meeting. Will discuss the PR in our next meeting.

dharaneeshvrd

It would be great if you can collect these timings along with the results somehow and display it in a tabular format instead of printing in separate lines.

dharaneeshvrd · 2025-12-10T11:39:00Z

spyre-rag/src/retrieve/backend_server.py

    max_tokens = data.get("max_tokens", 512)
    temperature = data.get("temperature", 0.0)
    stop_words = data.get("stop")
    stream = data.get("stream")


Don't you want to start a timer here to measure the overall execution of the query?

dharaneeshvrd · 2025-12-10T11:40:09Z

spyre-rag/src/retrieve/backend_server.py

+            start_time = time.time()
            vllm_stream = query_vllm_stream(prompt, docs, llm_endpoint, llm_model, stop_words, max_tokens, temperature, stream, dynamic_chunk_truncation=TRUNCATION)
+            request_time = time.time() - start_time
+            logger.info(f"Perf data: rag answer time = {request_time}")


What is the difference between rag answer time and llm inferencing time in llm_utils? It looks it would be produce same timings.

iv1111

I think the amount of logging is acceptable.

iv1111 · 2025-12-10T12:28:19Z

spyre-rag/src/retrieve/backend_server.py


        try:
+            start_time = time.time()
            vllm_stream = query_vllm_stream(prompt, docs, llm_endpoint, llm_model, stop_words, max_tokens, temperature, stream, dynamic_chunk_truncation=TRUNCATION)


This call to query_vllm_stream in streaming mode returns a generator and not a final answer. This generator (like a C++ iterator) is then read chunk by chunk with each piece being sent to Web UI even after this Python backend function returns. So if you want to measure very precisely you should probably test non-streaming case only.

@Niharika0306 chime in please to confirm.

@iv1111 , true.
I have addressed it in the updated PR

iv1111 · 2025-12-10T16:03:15Z

@carll99 You could try out this for request id retrieval:

from flask import request
.....
req_id = id(request)

Signed-off-by: Carl Love <cel@linux.ibm.com>

carll99 · 2025-12-10T16:17:06Z

The above comments are all really good. The concern was to make this as simple a possible given the pending release. The goal is to implement thread tracking and additional timings for a future release.

The blocking issue with the current patch is there is no sign off. I will close this PR and submit a new one with the sign off.

mkumatag added this to the .Next milestone Dec 10, 2025

iv1111 marked this pull request as draft December 10, 2025 09:09

mkumatag requested review from dharaneeshvrd and yussufsh December 10, 2025 11:17

dharaneeshvrd reviewed Dec 10, 2025

View reviewed changes

iv1111 reviewed Dec 10, 2025

View reviewed changes

This is proposed addition for performance data logging

b2bcaaa

Signed-off-by: Carl Love <cel@linux.ibm.com>

carll99 force-pushed the main branch from 9490cde to b2bcaaa Compare December 10, 2025 16:10

carll99 closed this Dec 10, 2025

Niharika0306 mentioned this pull request Dec 11, 2025

This is proposed addition for performance data logging #219

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This is proposed addition for performance data logging #215

This is proposed addition for performance data logging #215

Uh oh!

carll99 commented Dec 9, 2025

Uh oh!

dharaneeshvrd left a comment

Uh oh!

dharaneeshvrd Dec 10, 2025

Uh oh!

dharaneeshvrd Dec 10, 2025

Uh oh!

iv1111 left a comment

Uh oh!

iv1111 Dec 10, 2025 •

edited

Loading

Uh oh!

iv1111 Dec 10, 2025

Uh oh!

Niharika0306 Dec 11, 2025 •

edited

Loading

Uh oh!

iv1111 commented Dec 10, 2025 •

edited

Loading

Uh oh!

carll99 commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

This is proposed addition for performance data logging #215

This is proposed addition for performance data logging #215

Uh oh!

Conversation

carll99 commented Dec 9, 2025

Uh oh!

dharaneeshvrd left a comment

Choose a reason for hiding this comment

Uh oh!

dharaneeshvrd Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

dharaneeshvrd Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

iv1111 left a comment

Choose a reason for hiding this comment

Uh oh!

iv1111 Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

iv1111 Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Niharika0306 Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

iv1111 commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

carll99 commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

iv1111 Dec 10, 2025 •

edited

Loading

Niharika0306 Dec 11, 2025 •

edited

Loading

iv1111 commented Dec 10, 2025 •

edited

Loading