extend self-test log processing by aritas1 · Pull Request #151 · prometheus-community/smartctl_exporter

aritas1 · 2023-08-20T15:08:32Z

this adds metrics for monitoring the latest self-tests execution time.

also fix the missing smartctl_device_self_test_log_count metric due to missing --log=selftest argument.

Signed-off-by: Aritas1 <mail@aritas.de>

robbat2 · 2023-10-17T04:59:35Z

smartctl.go

+	// assume the table will always be in descending order
+	processedTypes := make(map[string]bool)
+
+	for _, logEntry := range smart.json.Get("ata_smart_self_test_log.standard.table").Array() {


This should accept either standard or extended. Some args & device combinations only have one of them. The layout of the json struct is the same.

robbat2 · 2023-10-17T05:01:34Z

smartctl.go

+			logTestType = "unknown"
+		}
+
+		if !processedTypes[logTestType] {


this is implicitly trusting that the tests appear in newest to oldest order. I don't know if I trust drives enough for that.

robbat2 · 2023-10-17T05:08:13Z

smartctl.go

+		testTime = testTime * 60 * 60
+
+		// skip running tests
+		if testRunningIndicator != 0 {


this is not correct, from one of my systems:

"status": { "value": 41, "string": "Interrupted (host reset)", "remaining_percent": 90 }

status.passeed is NOT present in this case.

I don't have any SATA drives w/ failing checks to compare presentlyy, but I worry they are also non-zero.

Ok, it's definetly in need of work; also in the smartctl sources:

std::string msgstat; switch (test_status >> 4) { case 0x0: msgstat = "Completed without error"; break; case 0x1: msgstat = "Aborted by host"; break; case 0x2: msgstat = "Interrupted (host reset)"; break; case 0x3: msgstat = "Fatal or unknown error"; break; case 0x4: msgstat = "Completed: unknown failure"; break; case 0x5: msgstat = "Completed: electrical failure"; break; case 0x6: msgstat = "Completed: servo/seek failure"; break; case 0x7: msgstat = "Completed: read failure"; break; case 0x8: msgstat = "Completed: handling damage??"; break; case 0xf: msgstat = "Self-test routine in progress"; break; default: msgstat = strprintf("Unknown status (0x%x)", test_status >> 4); }

So if it's 0xF then skip it as running; otherwise map the error.

robbat2 · 2023-10-17T05:10:46Z

smartctl.go

 }

+func (smart *SMARTctl) mineDeviceSelfTest() {
+	validTypes := map[int]string{


from smartctl sources:

switch (test_type) { case 0x00: msgtest = "Offline"; break; case 0x01: msgtest = "Short offline"; break; case 0x02: msgtest = "Extended offline"; break; case 0x03: msgtest = "Conveyance offline"; break; case 0x04: msgtest = "Selective offline"; break; case 0x7f: msgtest = "Abort offline test"; break; case 0x81: msgtest = "Short captive"; break; case 0x82: msgtest = "Extended captive"; break; case 0x83: msgtest = "Conveyance captive"; break; case 0x84: msgtest = "Selective captive"; break; default: if ((0x40 <= test_type && test_type <= 0x7e) || 0x90 <= test_type) msgtest = strprintf("Vendor (0x%02x)", test_type); else msgtest = strprintf("Reserved (0x%02x)", test_type); }

extend self-test log processing

9e14bc2

Signed-off-by: Aritas1 <mail@aritas.de>

aritas1 force-pushed the master branch from 528396b to 9e14bc2 Compare August 20, 2023 15:16

robbat2 suggested changes Oct 17, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extend self-test log processing#151

extend self-test log processing#151
aritas1 wants to merge 1 commit intoprometheus-community:masterfrom
aritas1:master

aritas1 commented Aug 20, 2023

Uh oh!

robbat2 Oct 17, 2023

Uh oh!

robbat2 Oct 17, 2023

Uh oh!

robbat2 Oct 17, 2023

Uh oh!

robbat2 Oct 17, 2023

Uh oh!

robbat2 Oct 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aritas1 commented Aug 20, 2023

Uh oh!

robbat2 Oct 17, 2023

Choose a reason for hiding this comment

Uh oh!

robbat2 Oct 17, 2023

Choose a reason for hiding this comment

Uh oh!

robbat2 Oct 17, 2023

Choose a reason for hiding this comment

Uh oh!

robbat2 Oct 17, 2023

Choose a reason for hiding this comment

Uh oh!

robbat2 Oct 17, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants