11.2U7 Warden Jails Lose Networking a few hours after reboot
Description
Problem/Justification
Impact
SmartDraw Connector
Katalon Manual Tests (BETA)
Activity

Waqar Ahmed February 3, 2020 at 4:10 AM
Thank you for your co-operation.

Brad Lhotsky January 31, 2020 at 10:36 PM
Going to upgrade to 11.3, but I have a few straggler warden jails.. since those are no longer supported on 11.3, I think you can close this issue. If I experience the same problem with iocage jails on 11.3, it will warrant it's own ticket.
Thanks for your support.

Waqar Ahmed January 29, 2020 at 11:35 PM
Hi , can you please let us know if there are any updates on the issue ? Thank you

William Gryzbowski January 6, 2020 at 9:02 PMEdited
I am sorry you feel us helping you troubleshoot a problem in a free and open source product, free of charge, is a ridiculous ask.
Unfortunately you're the first user ever reporting this specific problem and we have not been unable to reproduce the issue in our internal systems.
If you can think of anything that would help us troubleshoot it or reproduce the problem without causing any downtime, please let us know.
Thanks for your help.

Brad Lhotsky January 6, 2020 at 7:48 PM
The TL;DR: Just shutting off the warden jails for an extended time is a ridiculous ask. I use this serevr and those jails everyday and the time between fails is 4-48h. So, I've started the process of migrating all the warden jails to iocage as that's the only way for me to accomplish what's been asked.
However, I have rebooted back into U6 and haven't experienced any problems with my warden jails. It's been over a week without any networking issues. When I have all the warden jails migrated to iocage, I'll reapply U7 and see if the issue persists.
I am a bit disappointed by the troubleshooting requiring me to disable 100% of the user facing value in the system to debug the issue.
I created a number of plugin jails on FreeNAS 9. Those plugins were still working fine as of 11.2U6. Since upgrading to 11.2U7 every 20-40 hours of uptime, my jails lose their networking. There are no logs in dmesg, messages, or daemon.log to indicate what's going.
I cannot ping the host IP from the jail, and I cannot ping the jail IP from the host. I'm not as familiar with FreeBSD jails as I should be to debug this further, but I've tried:
1. Restarting the jail via the GUI, via jailctl, via service jail restart
2. Restarting the netif (suspected a bridge issues)
3. Manually upping and down the interfaces on both the host and jail
None of those restore connectivity. The only thing that restores networking is rebooting the FreeNAS server. This is not ideal.
I'm open to further debugging, but I have no idea what else I could google. I didn't change any Tunables, or install anything beyond the U7 update. This is seriously frustrating.