Complete
Pinned fields
Click on the next to a field label to start pinning.
Details
Assignee
Vladimir VinogradenkoVladimir VinogradenkoReporter
Josh WiselyJosh WiselyLabels
Components
Fix versions
Affects versions
Priority
Medium
Details
Details
Assignee
Vladimir Vinogradenko
Vladimir VinogradenkoReporter
Josh Wisely
Josh WiselyLabels
Components
Fix versions
Affects versions
Priority
More fields
More fields
More fields
Katalon Platform
Katalon Platform
Katalon Platform
Created March 5, 2021 at 10:45 PM
Updated July 1, 2022 at 5:13 PM
Resolved March 9, 2021 at 7:23 PM
There are 6 replication tasks that run every hour, but 10m offset from each other due to yet other issues running more than 2 replication jobs at once causing the system to panic.
At some point a job will encounter the following exception:
[2021/03/05 21:20:16] WARNING [retention] [zettarepl.zettarepl] Remote retention failed on : error listing snapshots: SSHException('Timeout opening channel.')
After that point ALL replication jobs will be stuck in WAITING status claiming the job is already running.
The only way to clear this state is to reboot the system.
Again, I've checked the box to attach a debug, but I suspect the bug about running the debug still exists and thus it won't be automatically attached.