Opensearch Vectordb changes by manju956 · Pull Request #281 · IBM/project-ai-services

manju956 · 2026-02-03T12:31:09Z

Introduce Opensearch Vectordb as a replacement for Milvus
Contains manifest file changes replacing references of milvus with opensearch. Additionally, opensearch uses user authentication for db interactions
Opensearch relavant changes in python backend code

spyre-rag/src/common/db_utils.py

ai-services/assets/applications/rag/templates/chat-bot.yaml.tmpl

ai-services/assets/applications/rag-dev/templates/chat-bot.yaml.tmpl

spyre-rag/src/common/db_utils.py

ai-services/assets/applications/rag/templates/opensearch.yaml.tmpl

ai-services/assets/applications/rag/values.yaml

images/rag-base/requirements.txt

spyre-rag/src/common/db_utils.py

dharaneeshvrd · 2026-02-04T10:12:27Z

spyre-rag/src/common/db_utils.py

+            "description": "Post-processor for hybrid search using RRF",
+            "phase_results_processors": [
+                {
+                    "normalization-processor": {


Seems normalization-processor & rrf are different techniques
You have used normalization-processor but only in id you have mentioned as rrf
For our use case normalization-processor is better suitable it seems.
But adding weights is critical

Can you please revisit this block?

exploring semantic heavy weights, will run tests for accurancy verification

ai-services/assets/applications/rag-dev/templates/chat-bot.yaml.tmpl

ai-services/assets/applications/rag-dev/values.yaml

images/rag-base/requirements.txt

Niharika0306 · 2026-02-06T04:59:21Z

a small comment - to handle the db-status call on opensearch. Without this, the UI fails to display response.

curl http://localhost:5001/db-status
{"message":"Empty value passed for a required argument 'index'.","ready":false}

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

manju956 · 2026-02-06T09:48:14Z

a small comment - to handle the db-status call on opensearch. Without this, the UI fails to display response.
curl http://localhost:5001/db-status
{"message":"Empty value passed for a required argument 'index'.","ready":false}

with latest commit in the PR, the issue is fixed.

yussufsh

Follow the OpenSearch trademark.

spyre-rag/src/ingest/cli.py

spyre-rag/README.md

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

dharaneeshvrd

Can you bump the rag version as well?

ai-services/assets/applications/rag-dev/templates/opensearch.yaml.tmpl

ai-services/assets/applications/rag/templates/opensearch.yaml.tmpl

spyre-rag/src/common/db_utils.py

dharaneeshvrd · 2026-02-06T14:07:46Z

spyre-rag/src/common/opensearch.py

+        }
+
+        try:
+            self.client.search_pipeline.delete(id="hybrid_rrf_pipeline")


why are we recreating here?
why can't we just check whether it exists or not?
also don't use rrf in id

I will check this

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

mkumatag · 2026-02-07T05:59:00Z

spyre-rag/src/common/vector_db.py

+from abc import ABC, abstractmethod
+from typing import List, Dict, Optional
+
+class VectorStore(ABC):


Please add details to these methods as much as possible

mkumatag · 2026-02-07T06:04:35Z

spyre-rag/src/common/vector_db.py

+
+class VectorStore(ABC):
+    @abstractmethod
+    def insert_chunks(self, emb_model: str, emb_endpoint: str, max_tokens: int, chunks: List[Dict], batch_size: int = 10):


Can we take the embedding out of this class so that we can use this class more efficiently? may be send a embedder class which has a method containing all the logic for embedding, I'm looking at this class should support 2 ways of searching/inserting 1. pure embedding 2. send text chunks(with embedding class)

mkumatag · 2026-02-07T06:04:49Z

spyre-rag/src/common/vector_db.py

+
+class VectorStoreNotReadyError():
+    """Raised when the database is unreachable or initializing."""
+    pass


add newline in the end

manju956 self-assigned this Feb 3, 2026

manju956 added the enhancement New feature or request label Feb 3, 2026

manju956 changed the title ~~Opensearch~~ Opensearch Vectordb changes Feb 3, 2026

manju956 requested review from dharaneeshvrd, iv1111, mkumatag and yussufsh and removed request for dharaneeshvrd, iv1111 and mkumatag February 3, 2026 12:45

mkumatag requested changes Feb 3, 2026

View reviewed changes

spyre-rag/src/common/db_utils.py Outdated Show resolved Hide resolved

mkumatag reviewed Feb 3, 2026

View reviewed changes

ai-services/assets/applications/rag/templates/chat-bot.yaml.tmpl Outdated Show resolved Hide resolved

dharaneeshvrd requested changes Feb 4, 2026

View reviewed changes

dharaneeshvrd reviewed Feb 4, 2026

View reviewed changes

manju956 force-pushed the opensearch branch from 871fdf9 to e5c4aa6 Compare February 4, 2026 14:59

mkumatag reviewed Feb 4, 2026

View reviewed changes

ai-services/assets/applications/rag-dev/values.yaml Show resolved Hide resolved

dharaneeshvrd reviewed Feb 6, 2026

View reviewed changes

images/rag-base/requirements.txt Show resolved Hide resolved

manju956 added 10 commits February 6, 2026 14:25

Migrate Milvus db to Opensearch

d716543

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Rectify manifests and Opensearch DB search logic

1859992

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Rectify ingest-docs template file

1f84df5

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Use DB password as per recommended guidelines

aff7207

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Modify livenessprobe for opensearch

73a6e7d

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Fix opensearch collection name field

df6ad75

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

remove sparse save and load from db_utils

84913f7

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Opensearch changes to rag-dev manifests

83c47be

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Use opensearch image pushed to ai-services-private namespace

4dc29d4

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Use Index inplace of Collection references

d47e40d

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

manju956 added 2 commits February 6, 2026 14:27

Fix ingestion cleanup issue due to garbled index name

ce01e96

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Store Opensearch credentials in values.yaml

3b81b4f

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

manju956 force-pushed the opensearch branch from f7a1306 to 3b81b4f Compare February 6, 2026 09:00

Fix opensearch container volume path

af12e49

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

yussufsh reviewed Feb 6, 2026

View reviewed changes

spyre-rag/src/ingest/cli.py Outdated Show resolved Hide resolved

spyre-rag/src/ingest/cli.py Outdated Show resolved Hide resolved

spyre-rag/README.md Outdated Show resolved Hide resolved

spyre-rag/README.md Outdated Show resolved Hide resolved

Introduce VectorStore abstract class and implement Opensearch

35af6ea

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

dharaneeshvrd reviewed Feb 6, 2026

View reviewed changes

manju956 and others added 2 commits February 6, 2026 21:20

fix vectorstore refactoring issues

d695073

Signed-off-by: manju956 <manjunath.ac956@gmail.com>

Merge branch 'main' into opensearch

a0162e4

mkumatag requested changes Feb 7, 2026

View reviewed changes

Conversation

manju956 commented Feb 3, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Niharika0306 commented Feb 6, 2026

Uh oh!

manju956 commented Feb 6, 2026

Uh oh!

yussufsh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dharaneeshvrd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants