Skip to content

Conversation

@kamanavishnu
Copy link
Collaborator

@kamanavishnu kamanavishnu commented Jan 15, 2026

Problem & Summary

In the current state, we do not have entity level metrics for maintenance jobs. All the metrics are at an aggregate level and it does not help with action items directly. For example, if the number of failed maintenance jobs is 10, there is no description of which entities the failed jobs correspond to. Someone needs to parse the logs to identify what are the tables impacted by these failures.

This change adds granular task level maintenance job metrics to solve for such cases.

Added metrics:

  • maintenance_job_triggered
    • Counter -- tracks number of maintenance jobs triggered per entity
  • maintenance_job_skipped
    • Counter -- tracks number of maintenance jobs skipped per entity
  • maintenance_job_completed
    • Counter -- tracks number of maintenance jobs completed per entity along with the status of the maintenance job

Changes

  • Client-facing API Changes
  • Internal API Changes
  • Bug Fixes
  • New Features
  • Performance Improvements
  • Code Style
  • Refactoring
  • Documentation
  • Observability
    • Add new metrics to track maintenance job updates at an entity level
  • Tests

Testing Done

Adding new metrics, no additional tests added

…enance job has been triggered for a particular table/ database
@kamanavishnu kamanavishnu marked this pull request as ready for review January 15, 2026 22:15
@shanthoosh shanthoosh merged commit 1b9bbe7 into linkedin:main Jan 16, 2026
1 check passed
@dushyantk1509
Copy link
Contributor

@kamanavishnu This PR seems to be missing maintenance_job_triggered metrics for parallel execution paths - buildOperationTaskListInParallel. Is this intentional?

@kamanavishnu
Copy link
Collaborator Author

@kamanavishnu This PR seems to be missing maintenance_job_triggered metrics for parallel execution paths - buildOperationTaskListInParallel. Is this intentional?

Good catch, updated it here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants