Pacemaker : Set Fence Device2025/02/26 |
Set Fence Device on Cluster. (see about Fencing on the site below) https://access.redhat.com/documentation/ja-jp/red_hat_enterprise_linux/8/html/configuring_and_managing_high_availability_clusters/s1-fencing-haao
It's possible to use many kinds of devices for fencing, APC or IPMI and so on. +--------------------+ | [ ISCSI Target ] | | dlp.srv.world | +---------+----------+ 10.0.0.30| | +----------------------+ | +----------------------+ | [ Cluster Node#1 ] |10.0.0.51 | 10.0.0.52| [ Cluster Node#2 ] | | node01.srv.world +----------+----------+ node02.srv.world | | | | | +----------------------+ +----------------------+ |
[1] |
Configure ISCSI Target and Create a storage for fence device, refer to here. |
[2] | |
[3] | On all Cluster Nodes, Install SCSI Fence Agent. |
[root@node01 ~]#
[root@node01 ~]# dnf --enablerepo=highavailability -y install fence-agents-scsi watchdog pcp-zeroconf cp /usr/share/cluster/fence_scsi_check /etc/watchdog.d/ [root@node01 ~]# systemctl enable --now watchdog |
[4] | Configure Fencing on a Node. [sda] of the example below is the storage from ISCSI target. |
# confirm disk ID [root@node01 ~]# ll /dev/disk/by-id | grep sda | grep wwn lrwxrwxrwx. 1 root root 9 Feb 26 08:58 wwn-0x60014057008c2012b3b4ce2ac6d7e0f7 -> ../../sda # set fencing # [scsi-shooter] : any name # [pcmk_host_list=***] : specify cluster nodes # [devices=***] : disk ID [root@node01 ~]# pcs stonith create scsi-shooter fence_scsi pcmk_host_list="node01.srv.world node02.srv.world" devices=/dev/disk/by-id/wwn-0x60014057008c2012b3b4ce2ac6d7e0f7 meta provides=unfencing
# show config [root@node01 ~]# pcs stonith config scsi-shooter Resource: scsi-shooter (class=stonith type=fence_scsi) Attributes: scsi-shooter-instance_attributes devices=/dev/disk/by-id/wwn-0x60014057008c2012b3b4ce2ac6d7e0f7 pcmk_host_list="node01.srv.world node02.srv.world" Meta Attributes: scsi-shooter-meta_attributes provides=unfencing Operations: monitor: scsi-shooter-monitor-interval-60s interval=60s # show status # OK if the status of fence device is [Started] [root@node01 ~]# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node02.srv.world (version 3.0.0-5.el10-1353d31) - partition with quorum * Last updated: Wed Feb 26 09:06:57 2025 on node01.srv.world * Last change: Wed Feb 26 09:05:25 2025 by root via root on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node01.srv.world node02.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node01.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled |
[5] | Try to test fencing. |
[root@node02 ~]# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node02.srv.world (version 3.0.0-5.el10-1353d31) - partition with quorum * Last updated: Wed Feb 26 09:06:57 2025 on node01.srv.world * Last change: Wed Feb 26 09:05:25 2025 by root via root on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node01.srv.world node02.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node01.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled # fencing [root@node02 ~]# pcs stonith fence node01.srv.world Node: node01.srv.world fenced # target node turns to [OFFLINE] and it will be restarted # after rebooting, status wull be back to [Online] [root@node02 ~]# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node02.srv.world (version 3.0.0-5.el10-1353d31) - partition with quorum * Last updated: Wed Feb 26 09:08:00 2025 on node02.srv.world * Last change: Wed Feb 26 09:05:25 2025 by root via root on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node02.srv.world ] * OFFLINE: [ node01.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node02.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled |
Sponsored Link |
|