Bug fixes in Deployment environments (buerokratt#164)

nuwangeek · erangi-ar · erangi-ar · web-flow · commit c29bd2f355b0 · 2025-11-21T12:29:18.000+05:30
* partialy completes prompt refiner * integrate prompt refiner with llm_config_module * fixed ruff lint issues * complete prompt refiner, chunk retriver and reranker * remove unnesessary comments * updated .gitignore * Remove data_sets from tracking * update .gitignore file * complete vault setup and response generator * remove ignore comment * removed old modules * fixed merge conflicts * Vault Authentication token handling (buerokratt#154) (#70) * partialy completes prompt refiner * integrate prompt refiner with llm_config_module * fixed ruff lint issues * complete prompt refiner, chunk retriver and reranker * remove unnesessary comments * updated .gitignore * Remove data_sets from tracking * update .gitignore file * complete vault setup and response generator * remove ignore comment * removed old modules * fixed merge conflicts * added initial setup for the vector indexer * initial llm orchestration service update with context generation * added new endpoints * vector indexer with contextual retrieval * fixed requested changes * fixed issue * initial diff identifier setup * uncommment docker compose file * added test endpoint for orchestrate service * fixed ruff linting issue * Rag 103 budget related schema changes (#41) * Refactor llm_connections table: update budget tracking fields and reorder columns * Add budget threshold fields and logic to LLM connection management * Enhance budget management: update budget status logic, adjust thresholds, and improve form handling for LLM connections * resolve pr comments & refactoring * rename commonUtils --------- * Rag 93 update connection status (#47) * Refactor llm_connections table: update budget tracking fields and reorder columns * Add budget threshold fields and logic to LLM connection management * Enhance budget management: update budget status logic, adjust thresholds, and improve form handling for LLM connections * resolve pr comments & refactoring * rename commonUtils * Implement LLM connection status update functionality with API integration and UI enhancements --------- * Rag 99 production llm connections logic (#46) * Refactor llm_connections table: update budget tracking fields and reorder columns * Add budget threshold fields and logic to LLM connection management * Enhance budget management: update budget status logic, adjust thresholds, and improve form handling for LLM connections * resolve pr comments & refactoring * rename commonUtils * Add production connection retrieval and update related components * Implement LLM connection environment update and enhance connection management logic --------- * Rag 119 endpoint to update used budget (#42) * Refactor llm_connections table: update budget tracking fields and reorder columns * Add budget threshold fields and logic to LLM connection management * Enhance budget management: update budget status logic, adjust thresholds, and improve form handling for LLM connections * resolve pr comments & refactoring * Add functionality to update used budget for LLM connections with validation and response handling * Implement budget threshold checks and connection deactivation logic in update process * resolve pr comments --------- * Rag 113 warning and termination banners (#43) * Refactor llm_connections table: update budget tracking fields and reorder columns * Add budget threshold fields and logic to LLM connection management * Enhance budget management: update budget status logic, adjust thresholds, and improve form handling for LLM connections * resolve pr comments & refactoring * Add budget status check and update BudgetBanner component * rename commonUtils * resove pr comments --------- * rag-105-reset-used-budget-cron-job (#44) * Refactor llm_connections table: update budget tracking fields and reorder columns * Add budget threshold fields and logic to LLM connection management * Enhance budget management: update budget status logic, adjust thresholds, and improve form handling for LLM connections * resolve pr comments & refactoring * Add cron job to reset used budget * rename commonUtils * resolve pr comments * Remove trailing slash from vault/agent-out in .gitignore --------- * Rag 101 budget check functionality (#45) * Refactor llm_connections table: update budget tracking fields and reorder columns * Add budget threshold fields and logic to LLM connection management * Enhance budget management: update budget status logic, adjust thresholds, and improve form handling for LLM connections * resolve pr comments & refactoring * rename commonUtils * budget check functionality --------- * gui running on 3003 issue fixed * gui running on 3003 issue fixed (#50) * added get-configuration.sqpl and updated llmconnections.ts * Add SQL query to retrieve configuration values * Hashicorp key saving (#51) * gui running on 3003 issue fixed * Add SQL query to retrieve configuration values --------- * Remove REACT_APP_NOTIFICATION_NODE_URL variable Removed REACT_APP_NOTIFICATION_NODE_URL environment variable. * added initil diff identifier functionality * test phase1 * Refactor inference and connection handling in YAML and TypeScript files * fixes (#52) * gui running on 3003 issue fixed * Add SQL query to retrieve configuration values * Refactor inference and connection handling in YAML and TypeScript files --------- * Add entry point script for Vector Indexer with command line interface * fix (#53) * gui running on 3003 issue fixed * Add SQL query to retrieve configuration values * Refactor inference and connection handling in YAML and TypeScript files * Add entry point script for Vector Indexer with command line interface --------- * diff fixes * uncomment llm orchestration service in docker compose file * complete vector indexer * Add YAML configurations and scripts for managing vault secrets * Add vault secret management functions and endpoints for LLM connections * Add Test Production LLM page with messaging functionality and styles * fixed issue * fixed merge conflicts * fixed issue * fixed issue * updated with requested chnages * fixed test ui endpoint request responses schema issue * fixed dvc path issue * added dspy optimization * filters fixed * refactor: restructure llm_connections table for improved configuration and tracking * feat: enhance LLM connection handling with AWS and Azure embedding credentials * fixed issues * refactor: remove redundant Azure and AWS credential assignments in vault secret functions * fixed issue * intial vault setup script * complete vault authentication handling * review requested change fix * fixed issues according to the pr review * fixed issues in docker compose file relevent to pr review --------- Co-authored-by: Charith Nuwan Bimsara <59943919+nuwangeek@users.noreply.github.com> Co-authored-by: erangi-ar <erangika.ariyasena@rootcode.io> * initial streaming updates * fixed requested chnges * fixed issues * complete stream handling in python end * remove unnesasary files * fix test environment issue * fixed constant issue --------- Co-authored-by: erangi-ar <111747955+erangi-ar@users.noreply.github.com> Co-authored-by: erangi-ar <erangika.ariyasena@rootcode.io>
diff --git a/DSL/CronManager/script/store_secrets_in_vault.sh b/DSL/CronManager/script/store_secrets_in_vault.sh
@@ -68,7 +68,7 @@ build_vault_path() {
         model=$(get_model_name)
     fi
     
-    if [ "$deploymentEnvironment" = "test" ]; then
+    if [ "$deploymentEnvironment" = "testing" ]; then
         echo "secret/$secret_type/connections/$platform/$deploymentEnvironment/$connectionId"
     else
         echo "secret/$secret_type/connections/$platform/$deploymentEnvironment/$model"
diff --git a/DSL/Ruuter.private/rag-search/POST/inference/test.yml b/DSL/Ruuter.private/rag-search/POST/inference/test.yml
@@ -62,7 +62,7 @@ call_orchestrate_endpoint:
     body:
       connectionId: ${connectionId}
       message: ${message}
-      environment: "test"
+      environment: "testing"
     headers:
       Content-Type: "application/json"
   result: orchestrate_result
diff --git a/src/llm_orchestration_service.py b/src/llm_orchestration_service.py
@@ -27,6 +27,7 @@
     INPUT_GUARDRAIL_VIOLATION_MESSAGE,
     OUTPUT_GUARDRAIL_VIOLATION_MESSAGE,
     GUARDRAILS_BLOCKED_PHRASES,
+    TEST_DEPLOYMENT_ENVIRONMENT,
 )
 from src.utils.cost_utils import calculate_total_costs, get_lm_usage_since
 from src.guardrails import NeMoRailsAdapter, GuardrailCheckResult
@@ -770,7 +771,7 @@ def handle_input_guardrails(
 
         if not input_check_result.allowed:
             logger.warning(f"Input blocked by guardrails: {input_check_result.reason}")
-            if request.environment == "test":
+            if request.environment == TEST_DEPLOYMENT_ENVIRONMENT:
                 logger.info(
                     "Test environment detected – returning input guardrail violation message."
                 )
@@ -941,7 +942,7 @@ def _initialize_guardrails(
         Initialize NeMo Guardrails adapter.
 
         Args:
-            environment: Environment context (production/test/development)
+            environment: Environment context (production/testing/development)
             connection_id: Optional connection identifier
 
         Returns:
@@ -1257,7 +1258,7 @@ def _initialize_llm_manager(
         Initialize LLM Manager with proper configuration.
 
         Args:
-            environment: Environment context (production/test/development)
+            environment: Environment context (production/testing/development)
             connection_id: Optional connection identifier
 
         Returns:
@@ -1480,7 +1481,7 @@ def _generate_rag_response(
             logger.warning(
                 "Response generator unavailable – returning technical issue message."
             )
-            if request.environment == "test":
+            if request.environment == TEST_DEPLOYMENT_ENVIRONMENT:
                 logger.info(
                     "Test environment detected – returning technical issue message."
                 )
@@ -1547,7 +1548,7 @@ def _generate_rag_response(
                 )
             if question_out_of_scope:
                 logger.info("Question determined out-of-scope – sending fixed message.")
-                if request.environment == "test":
+                if request.environment == TEST_DEPLOYMENT_ENVIRONMENT:
                     logger.info(
                         "Test environment detected – returning out-of-scope message."
                     )
@@ -1568,7 +1569,7 @@ def _generate_rag_response(
 
             # In-scope: return the answer as-is (NO citations)
             logger.info("Returning in-scope answer without citations.")
-            if request.environment == "test":
+            if request.environment == TEST_DEPLOYMENT_ENVIRONMENT:
                 logger.info("Test environment detected – returning generated answer.")
                 return TestOrchestrationResponse(
                     llmServiceActive=True,
@@ -1598,7 +1599,7 @@ def _generate_rag_response(
                     }
                 )
             # Standardized technical issue; no second LLM call, no citations
-            if request.environment == "test":
+            if request.environment == TEST_DEPLOYMENT_ENVIRONMENT:
                 logger.info(
                     "Test environment detected – returning technical issue message."
                 )
@@ -1635,7 +1636,7 @@ def create_embeddings_for_indexer(
 
         Args:
             texts: List of texts to embed
-            environment: Environment (production, development, test)
+            environment: Environment (production, development, testing)
             connection_id: Optional connection ID for dev/test environments
             batch_size: Batch size for processing
 
@@ -1691,7 +1692,7 @@ def get_available_embedding_models_for_indexer(
         """Get available embedding models for vector indexer.
 
         Args:
-            environment: Environment (production, development, test)
+            environment: Environment (production, development, testing)
 
         Returns:
             Dictionary with available models and default model info
diff --git a/src/llm_orchestrator_config/llm_cochestrator_constants.py b/src/llm_orchestrator_config/llm_cochestrator_constants.py
@@ -24,3 +24,4 @@
 
 # Streaming configuration
 STREAMING_ALLOWED_ENVS = {"production"}
+TEST_DEPLOYMENT_ENVIRONMENT = "testing"
diff --git a/src/models/request_models.py b/src/models/request_models.py
@@ -33,7 +33,7 @@ class OrchestrationRequest(BaseModel):
         ..., description="Previous conversation history"
     )
     url: str = Field(..., description="Source URL context")
-    environment: Literal["production", "test", "development"] = Field(
+    environment: Literal["production", "testing", "development"] = Field(
         ..., description="Environment context"
     )
     connection_id: Optional[str] = Field(
@@ -66,7 +66,7 @@ class EmbeddingRequest(BaseModel):
     """
 
     texts: List[str] = Field(..., description="List of texts to embed", max_length=1000)
-    environment: Literal["production", "development", "test"] = Field(
+    environment: Literal["production", "development", "testing"] = Field(
         ..., description="Environment for model resolution"
     )
     batch_size: Optional[int] = Field(
@@ -97,7 +97,7 @@ class ContextGenerationRequest(BaseModel):
         ..., description="Document content for caching", max_length=100000
     )
     chunk_prompt: str = Field(..., description="Chunk-specific prompt", max_length=5000)
-    environment: Literal["production", "development", "test"] = Field(
+    environment: Literal["production", "development", "testing"] = Field(
         ..., description="Environment for model resolution"
     )
     use_cache: bool = Field(default=True, description="Enable prompt caching")
@@ -138,7 +138,7 @@ class TestOrchestrationRequest(BaseModel):
     """Model for simplified test orchestration request."""
 
     message: str = Field(..., description="User's message/query")
-    environment: Literal["production", "test", "development"] = Field(
+    environment: Literal["production", "testing", "development"] = Field(
         ..., description="Environment context"
     )
     connectionId: Optional[int] = Field(

Original file line number	Diff line number	Diff line change
`@@ -24,3 +24,4 @@`
`24`	`24`
`25`	`25`	`# Streaming configuration`
`26`	`26`	`STREAMING_ALLOWED_ENVS = {"production"}`
	`27`	`+TEST_DEPLOYMENT_ENVIRONMENT = "testing"`
Original file line number	Diff line number	Diff line change
`@@ -33,7 +33,7 @@ class OrchestrationRequest(BaseModel):`
`33`	`33`	`..., description="Previous conversation history"`
`34`	`34`	`)`
`35`	`35`	`url: str = Field(..., description="Source URL context")`
`36`		`- environment: Literal["production", "test", "development"] = Field(`
	`36`	`+ environment: Literal["production", "testing", "development"] = Field(`
`37`	`37`	`..., description="Environment context"`
`38`	`38`	`)`
`39`	`39`	`connection_id: Optional[str] = Field(`
`@@ -66,7 +66,7 @@ class EmbeddingRequest(BaseModel):`
`66`	`66`	`"""`
`67`	`67`
`68`	`68`	`texts: List[str] = Field(..., description="List of texts to embed", max_length=1000)`
`69`		`- environment: Literal["production", "development", "test"] = Field(`
	`69`	`+ environment: Literal["production", "development", "testing"] = Field(`
`70`	`70`	`..., description="Environment for model resolution"`
`71`	`71`	`)`
`72`	`72`	`batch_size: Optional[int] = Field(`
`@@ -97,7 +97,7 @@ class ContextGenerationRequest(BaseModel):`
`97`	`97`	`..., description="Document content for caching", max_length=100000`
`98`	`98`	`)`
`99`	`99`	`chunk_prompt: str = Field(..., description="Chunk-specific prompt", max_length=5000)`
`100`		`- environment: Literal["production", "development", "test"] = Field(`
	`100`	`+ environment: Literal["production", "development", "testing"] = Field(`
`101`	`101`	`..., description="Environment for model resolution"`
`102`	`102`	`)`
`103`	`103`	`use_cache: bool = Field(default=True, description="Enable prompt caching")`
`@@ -138,7 +138,7 @@ class TestOrchestrationRequest(BaseModel):`
`138`	`138`	`"""Model for simplified test orchestration request."""`
`139`	`139`
`140`	`140`	`message: str = Field(..., description="User's message/query")`
`141`		`- environment: Literal["production", "test", "development"] = Field(`
	`141`	`+ environment: Literal["production", "testing", "development"] = Field(`
`142`	`142`	`..., description="Environment context"`
`143`	`143`	`)`
`144`	`144`	`connectionId: Optional[int] = Field(`