Draft
Conversation
…sure time spent in each stage of the RAG pipeline. (#317) * adding oberservablility * Update docs/debugging.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Update docs/observability.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Add query-to-answer-pipeline doc and observability/debugging updates * Trigger CI * getting build to kick in for observability file * Fix typos in query-to-answer-pipeline.md and ensure file in PR for link check * get rid of PULL_REQUEST_SUMMARY --------- Co-authored-by: nkmcalli <nkmcalli@yahoo.com>
…odal-query-integration-test Add multimodal query integration tests to CI pipeline
* updated files per bug 5880717 * Update CONTRIBUTING.md * Update README.md * Update python-client.md * Update readme.md * Update readme.md
* updated helm instructions * Update deploy-helm.md
… Pro is not supported in this release.heiss/5863956a (#335) * Add release note for Audio model deployment on Kubernetes on RTX‑6000 Pro is not supported in this release. * Add release note for Audio model deployment on Kubernetes on RTX‑6000 Pro is not supported in this release.
* Hotfix doc release v2.4.0.rc3 (#323) * docs: fix typos, grammar, and broken links in documentation - README: remove duplicate 'with', fix 'e.g.' punctuation, fix link spacing - ci/README: GitLab CI -> GitHub Actions CI pipeline - docs/support-matrix: Bluprint -> Blueprint, fix link spacing - docs/deploy-docker-self-hosted: add 'are' before deployed, NIMS -> NIMs - docs/troubleshooting: fix stray markdown, subsequent deployments section - docs/release-notes: DRA -> MIG, Nvidia -> NVIDIA, fix punctuation and its/it's - docs/python-client: add missing closing quote in install command - docs/text_only_ingest: remove duplicate 'the' - docs/multi-collection-retrieval: its -> it's (it is enabled) - docs/query_decomposition: add note for 1997/Naples example - docs/user-interface: 750 px -> 750px - deploy/workbench: fix hardware-requirements link to support-matrix, model v1.5 - tests/integration/README: fix test_cases formatting Co-authored-by: Cursor <cursoragent@cursor.com> * fix: documents words --------- Co-authored-by: Cursor <cursoragent@cursor.com> * Update tech diagram (#329) * Fixing mcp server bug (#325) Signed-off-by: Niyati Singal <nsingal@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com> * changes to docs per bug 5767861 (#328) * Updated launchable with v2.4.0 tag (#318) * updated support matrix (#321) * Document the end‑to‑end flow from query to answer and show how to measure time spent in each stage of the RAG pipeline. (#317) * adding oberservablility * Update docs/debugging.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Update docs/observability.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Add query-to-answer-pipeline doc and observability/debugging updates * Trigger CI * getting build to kick in for observability file * Fix typos in query-to-answer-pipeline.md and ensure file in PR for link check * get rid of PULL_REQUEST_SUMMARY --------- Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * fixed files associated with build (#322) * Add multimodal query integration tests to CI pipeline * changes to docs per bug 5767861 * updated files per bug 5880717 (#327) * updated files per bug 5880717 * Update CONTRIBUTING.md * Update README.md * Update python-client.md * Update readme.md * Update readme.md * Update docs/deploy-helm.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Update docs/deploy-helm.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> --------- Co-authored-by: rkharwar-nv <rkharwar@nvidia.com> Co-authored-by: nkmcalli <nkmcalli@yahoo.com> Co-authored-by: Pranjal Doshi <pranjald@nvidia.com> Co-authored-by: nv-pranjald <150428320+nv-pranjald@users.noreply.github.com> * Fix workflow rule and doc bugs (#331) * Revert back milvus version in conf.md to v2.6.5 * Modify workflow to run on any branch * Fix workflow push rule to run on protected branches * Add files via upload (#326) Found an error in the Q&A section where images in the citation were not being printed. * Update transformers version to 5.1.0 (#332) * Updated launchable with v2.4.0 tag (#318) * Fix image links --------- Signed-off-by: Niyati Singal <nsingal@nvidia.com> Co-authored-by: Johnny J <johnnyj@nvidia.com> Co-authored-by: Cursor <cursoragent@cursor.com> Co-authored-by: Shubhadeep Das <149712532+shubhadeepd@users.noreply.github.com> Co-authored-by: niyatisingal <nsingal@nvidia.com> Co-authored-by: rkharwar-nv <rkharwar@nvidia.com> Co-authored-by: nkmcalli <nkmcalli@yahoo.com> Co-authored-by: Pranjal Doshi <pranjald@nvidia.com> Co-authored-by: nv-pranjald <150428320+nv-pranjald@users.noreply.github.com>
* updated change-model.md per 5878193 * Update change-model.md * Update change-model.md * Update change-model.md * Update docs/change-model.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Update docs/change-model.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> --------- Co-authored-by: nkmcalli <nkmcalli@yahoo.com>
* changes to docs per bug 5767861 (#328) * Updated launchable with v2.4.0 tag (#318) * updated support matrix (#321) * Document the end‑to‑end flow from query to answer and show how to measure time spent in each stage of the RAG pipeline. (#317) * adding oberservablility * Update docs/debugging.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Update docs/observability.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Add query-to-answer-pipeline doc and observability/debugging updates * Trigger CI * getting build to kick in for observability file * Fix typos in query-to-answer-pipeline.md and ensure file in PR for link check * get rid of PULL_REQUEST_SUMMARY --------- Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * fixed files associated with build (#322) * Add multimodal query integration tests to CI pipeline * changes to docs per bug 5767861 * updated files per bug 5880717 (#327) * updated files per bug 5880717 * Update CONTRIBUTING.md * Update README.md * Update python-client.md * Update readme.md * Update readme.md * Update docs/deploy-helm.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Update docs/deploy-helm.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> --------- Co-authored-by: rkharwar-nv <rkharwar@nvidia.com> Co-authored-by: nkmcalli <nkmcalli@yahoo.com> Co-authored-by: Pranjal Doshi <pranjald@nvidia.com> Co-authored-by: nv-pranjald <150428320+nv-pranjald@users.noreply.github.com> * Fix workflow rule and doc bugs (#331) * Revert back milvus version in conf.md to v2.6.5 * Modify workflow to run on any branch * Fix workflow push rule to run on protected branches * Add files via upload (#326) Found an error in the Q&A section where images in the citation were not being printed. * Doc bug fixes (#339) * updated helm instructions (#333) * updated helm instructions * Update deploy-helm.md * fix broken image link (#334) * Add release note for Audio model deployment on Kubernetes on RTX‑6000 Pro is not supported in this release.heiss/5863956a (#335) * Add release note for Audio model deployment on Kubernetes on RTX‑6000 Pro is not supported in this release. * Add release note for Audio model deployment on Kubernetes on RTX‑6000 Pro is not supported in this release. * Fix broken image link in observability file * Fix CPU seach with GPU index doc * Fix VLLM profile instruction for nemotron-3-nano --------- Co-authored-by: Kurt Heiss <kheiss@nvidia.com> * Updated troubleshoot documentation for Elasticsearch connection timeout (#341) Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com> * Changes for final Release readiness (#349) * fixed files for build purposes (#343) * Fix status 500 on unknown task and summary status for plain Redis and tokenizer encode_plus attribute error (#342) * Remove rc tag from containers and helm chart * added missing parentheses (#347) * fix doc link defect per Z. Huang review spreadsheet (#346) --------- Co-authored-by: Kurt Heiss <kheiss@nvidia.com> Co-authored-by: kumar-punit <punitk@nvidia.com> * Fix Elasticsearch auth helm steps in doc (#350) * Added ingestor server crash due to OOM issue incase of large files ingestion as known limitation in troubleshooting.md doc (#353) * Update pillow and crytography version (#352) * Update pillow and crytography version * Enable job continuation on failure * Remove hard dependency of pillow and crytography from pyproject --------- Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com> Co-authored-by: Kurt Heiss <kheiss@nvidia.com> Co-authored-by: rkharwar-nv <rkharwar@nvidia.com> Co-authored-by: nkmcalli <nkmcalli@yahoo.com> Co-authored-by: Pranjal Doshi <pranjald@nvidia.com> Co-authored-by: nv-pranjald <150428320+nv-pranjald@users.noreply.github.com> Co-authored-by: Swapnil Masurekar <smasurekar@nvidia.com> Co-authored-by: kumar-punit <punitk@nvidia.com> Co-authored-by: Nikhil Kulkarni <nikkulkarni@nvidia.com>
Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com>
Updated branch name State name changed from "FAILURE"->"FAILED"
…recated SpanAttributes (#377) Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com>
Signed-off-by: Niyati Singal <nsingal@nvidia.com>
* Added MIG Slice support for RTX 6000 pro Signed-off-by: Punit Kumar <punitk@nvidia.com> * Changed to default config in MIG slicing in rtx6000pro config --------- Signed-off-by: Punit Kumar <punitk@nvidia.com> Co-authored-by: niyatisingal <nsingal@nvidia.com>
…385) * changes to docs per bug 5767861 (#328) * Updated launchable with v2.4.0 tag (#318) * updated support matrix (#321) * Document the end‑to‑end flow from query to answer and show how to measure time spent in each stage of the RAG pipeline. (#317) * adding oberservablility * Update docs/debugging.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Update docs/observability.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Add query-to-answer-pipeline doc and observability/debugging updates * Trigger CI * getting build to kick in for observability file * Fix typos in query-to-answer-pipeline.md and ensure file in PR for link check * get rid of PULL_REQUEST_SUMMARY --------- Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * fixed files associated with build (#322) * Add multimodal query integration tests to CI pipeline * changes to docs per bug 5767861 * updated files per bug 5880717 (#327) * updated files per bug 5880717 * Update CONTRIBUTING.md * Update README.md * Update python-client.md * Update readme.md * Update readme.md * Update docs/deploy-helm.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> * Update docs/deploy-helm.md Co-authored-by: nkmcalli <nkmcalli@yahoo.com> --------- Co-authored-by: rkharwar-nv <rkharwar@nvidia.com> Co-authored-by: nkmcalli <nkmcalli@yahoo.com> Co-authored-by: Pranjal Doshi <pranjald@nvidia.com> Co-authored-by: nv-pranjald <150428320+nv-pranjald@users.noreply.github.com> * Fix workflow rule and doc bugs (#331) * Revert back milvus version in conf.md to v2.6.5 * Modify workflow to run on any branch * Fix workflow push rule to run on protected branches * Add files via upload (#326) Found an error in the Q&A section where images in the citation were not being printed. * Doc bug fixes (#339) * updated helm instructions (#333) * updated helm instructions * Update deploy-helm.md * fix broken image link (#334) * Add release note for Audio model deployment on Kubernetes on RTX‑6000 Pro is not supported in this release.heiss/5863956a (#335) * Add release note for Audio model deployment on Kubernetes on RTX‑6000 Pro is not supported in this release. * Add release note for Audio model deployment on Kubernetes on RTX‑6000 Pro is not supported in this release. * Fix broken image link in observability file * Fix CPU seach with GPU index doc * Fix VLLM profile instruction for nemotron-3-nano --------- Co-authored-by: Kurt Heiss <kheiss@nvidia.com> * Updated troubleshoot documentation for Elasticsearch connection timeout (#341) Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com> * updated path to image files so that html output is rendered correctly (#363) * Updated helm instructions for mig-deployment prerequisites (#364) * Updated helm instructions for mig-deployment * Update mig-deployment.md * Doc enhancement for noteboook (#361) * Doc enhancement for noteboook * Update release notes * Update launchable.ipynb (#365) Updated branch name State name changed from "FAILURE"->"FAILED" * Fix typo in release notes --------- Co-authored-by: rkharwar-nv <rkharwar@nvidia.com> * fixed links in deploy-helm and mig-deploymnent (#367) * update artifacts to GA version for v2.4.0 release (#359) * updated files according to style guide (#369) * Revert deploy-helm and mig-deployment to pre-11a31a4 versions (#372) * Fix release date in changelog (#373) * Bump up version to 2.5.0 --------- Signed-off-by: Swapnil Masurekar <smasurekar@nvidia.com> Co-authored-by: Kurt Heiss <kheiss@nvidia.com> Co-authored-by: rkharwar-nv <rkharwar@nvidia.com> Co-authored-by: nkmcalli <nkmcalli@yahoo.com> Co-authored-by: Pranjal Doshi <pranjald@nvidia.com> Co-authored-by: nv-pranjald <150428320+nv-pranjald@users.noreply.github.com> Co-authored-by: Swapnil Masurekar <smasurekar@nvidia.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Updated conf.py file to include switcher text: "switcher": {"json_url": "../versions1.json", "version_match": release},