UEFI boot fails, possibly due to large number of devices

Description

I did an update from 12.0-BETA2.1 to RC1 and after the initial reboot the system did not come back up.

I did a config backup during the upgrade so I downloaded the RC1 iso to try and do a fresh install but the RC1 iso won't boot either. The "\" cursor spins for a few seconds then stops.

Attatched screenshot is the installed version trying to boot. Never see any errors during installed or ISO boot.

System is Supermicro SSG-6048R-E1CR36H, Intel E5-2670 v3. If there is a way to get more verbose output or any kind of debug logs I'll be happy to grab them if you can point me in the right direction.

Problem/Justification

None

Impact

None

SmartDraw Connector

Katalon Manual Tests (BETA)

Activity

Show:

Ryan MoellerOctober 13, 2020 at 2:53 PM

This is a limitation of the system firmware. The workaround is to disable devices in firmware as you have done so that the boot disks are not left out of the device list.

Ryan MorrisOctober 12, 2020 at 8:25 PM
Edited

I wasn't sure which PCI slot was which so I went the trial and error route. (These and SFTP+ card are only ones attached).

disabling all but slot 1 oprom:  showed 73 devices, got past that screen but errored out at the kernel selection after autoboot timer finished, "invalid command"

disabling all but slot 2 oprom: showed 48 devices, booted fine. 

Alexander MotinOctober 10, 2020 at 4:52 PM

The fact that system boots with less devices reported is a good sign by itself, since it gives us direction to move.

Alexander MotinOctober 10, 2020 at 4:51 PM

What I would check is ability to completely disable OPROM for PCIe slots of the MegaRAIDs in motherboard BIOS, since you should not need them.  If not available, then disable BIOS boot support in the MegaRAIDs OPROM's if available.

Ryan MorrisOctober 9, 2020 at 8:00 PM
Edited

This system, when going into the ctrl+r megaraid configuration, lists 3 LSI 3108 MegaRAID adapters. #1 has 2 drives, #2 has 12 drives, and #3 has 24 drives. For a total of 38 actual drives. They are all set to JBOD mode. The BIOS itself has 2 sections , SATA Configuration, and sSATA Configuration, both of those are already disabled (I tried enabling and rebooting, same issue). When enabled, between the 2 of them there are 10 additional SATA ports shown as available . Even though when inside the configuration it shows all 3 as the same model, during boot they show as shown in the attached image. Firmware is latest available through supermicro. Enabling them also did not change the # of block devices show while booting

 

Again, after checking all this I would get stuck at the screen in the original post. My only option was to pull every drive except the 2 truenas core are installed on. Then it let me boot, I reinserted all the drives, did a zpool import, and I'm back up, but all the replication tasks are stuck on "HOLD" because Pool BACKUPS is offline, even though I imported it and it shows online under storage. After editing one of them, hitting save (no actual changes made) this brings them all back into a runnable state.

 

 

User Configuration Error

Details

Assignee

Reporter

Labels

Components

Fix versions

Affects versions

Priority

More fields

Katalon Platform

Created September 17, 2020 at 4:29 PM
Updated July 1, 2022 at 4:55 PM
Resolved October 13, 2020 at 2:53 PM