Outcome of node or network failures

  • Kernel panic
    Reboot after 30 s
  • uhu not responding in operation
    Node waits indefinitely for resume of nfs operation

    dyn-2e-f4-6d:~# date
    Fri May 14 12:10:35 CEST 2010
    dyn-2e-f4-6d:~# dir
    [  959.792370] nfs: server 192.168.16.10 not responding, still trying
    [ 1003.793368] nfs: server 192.168.16.10 not responding, still trying
    [ 1917.894851] nfs: server 192.168.16.10 OK
    [ 1917.896113] nfs: server 192.168.16.10 OK

    this could be improved by userland watchdog deamon which reads from nfs

  • uhu not responding during restart
    Node constantly tries to reboot