Hello,
I have created software watchdog using command:
Code:
$ sudo modprobe softdog soft_margin=60
And in OS logs I have seen message:
Code:
[ 3.757002] softdog: Software Watchdog Timer: 0.08 initialized. soft_noboot=0 soft_margin=60 sec soft_panic=0 (nowayout=0)
When I trigger node reboot using command below, node reboots, but there is no information in the log that softdog triggered reboot:
Code:
echo a | sudo tee /dev/watchdog
If software watchdog is created using
soft_noboot=1 option, just to verify softdog without actual reboot, then information is logged:
Code:
softdog: Triggered - Reboot ignored
Based on the softdog implementation we should have log:
https://github.com/spacex/kernel-cen...hdog/softdog.c
Code:
static void watchdog_fire(unsigned long data)
{
if (soft_noboot)
pr_crit("Triggered - Reboot ignored\n");
else if (soft_panic) {
pr_crit("Initiating panic\n");
panic("Software Watchdog Timer expired");
} else {
pr_crit("Initiating system reboot\n");
emergency_restart();
pr_crit("Reboot didn't ?????\n");
}
}
OS: CentOS Linux release 7.9.2009 (Core)
Linux test1 3.10.0-1160.62.1.el7.x86_64 #1 SMP Tue Apr 5 16:57:59 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
It looks like node was rebooted before log info was written to disk.
Can I somehow verify that node was rebooted by software watchdog?
Thank you