SMART tests do not start

Description

All SMART tests but shorts set up at installation and in the web GUI are never ran.

Even though the "Tasks" tab displays 3 types of scheduled SMART tests (conveyance, long and short), only short SMART tests are actually ran on the disks.

I have seen many issues mentioning this problem or somewhat similar on previous releases of TrueNAS Core but, my installation, going back to around 11.x, still exhibits this behavior in

TrueNAS-12.0-U8.1 .

Problem/Justification

None

Impact

None

Activity

Show:

Bug Clerk June 23, 2022 at 10:40 AM

Bug Clerk June 9, 2022 at 9:00 PM

Julien-Benjamin RUIZ May 23, 2022 at 4:02 AM
Edited

I was not aware smartmontools behaved like this.

I'll try that and report back.

Edit: Sorry, I said that I did not know how smartmontools behaved, but, actually I know ; of course, you cannot run two tests on the same drive at the same time. The only layer in the stack I do not really know how it behaves is your middleware.

Moreover, since the web GUI allow you, and does not warn you about potential overlapping in tests, I did not know if your middleware was “queuing” SMART tests or not. BTW, the settings you saw, were the defaults set by the installer, I did not think changes were needed, especially, since again, I do not how your middleware works.

Either way, there is no feedback, neither from smartmontools (obviously, but this cannot be helped), or your GUI/middleware. I know that tests length differs from tests types and drives models, but, I feel there should be some warning, at the very least, when two tests are supposed to be running at the same time. Or maybe queuing the SMART tests, since it is handled by your middleware. Especially since, like said above, these tests were set up like this by default, by the installer (IIRC, it was the TrueNAS-12.0-U3, for this machine).

Anyway, just to be sure, I will report back after the next long SMART test runs successfully (or not).

Vladimir Vinogradenko May 19, 2022 at 9:47 PM

now I see what the issue is: both long and short tests are scheduled to run on midnight. smartmontools behave the following way (from man smartd.conf):

"As soon as a match is found, the test will be started and no additional matches will be sought for that device and that polling cycle."

So short test is started on every day, and on Sunday that prevents long test from being started. You can reschedule short test to run on all weekdays except Sundays and that'll fix the issue.

Julien-Benjamin RUIZ May 19, 2022 at 6:59 PM
Edited

Since I had multiple HDDs and HDDs cable issues, there were indeed lots of reboots when trying to troubleshoot and fix the issues (as you can probably tell from the log on the main pool). It is still resilvering.

FYI, since I guess you will see it in the logs: I've also move multiple times the location of /var/log, and there were also other unplanned reboots to address other problems and maintenance tasks.

I've updated a new debug archive.

Edit: In case you would have/need the SMART logs from disks, the only other tests ran on them were done manually, as timestamps should show.

Complete
Pinned fields
Click on the next to a field label to start pinning.

Details

Assignee

Reporter

Impact

Low

Time remaining

0m

Components

Affects versions

Priority

Katalon Platform

Created May 14, 2022 at 6:23 PM
Updated July 14, 2022 at 2:03 PM
Resolved July 14, 2022 at 2:03 PM