SQL Server 2022 : Failover Cluster Instance2023/12/05 |
Configure SQL Server Failover Cluster Instance.
This example is based on the environment like follows.
+-------------------+ | [dlp.srv.world] | | ISCSI Target | +---------+---------+ 10.0.0.30| | +----------------------+ | +----------------------+ | [ node01.srv.world ] |10.0.0.51 | 10.0.0.52| [ node02.srv.world ] | | CLVM +----------+----------+ CLVM | | SQL Server | | | SQL Server | +----------------------+ | +----------------------+ VIP:10.0.0.50 |
[1] |
Configure basic Cluster setting on all Nodes of Failover Cluster Instance, refer to here. |
[2] |
Configure Fence Device on all Nodes of Failover Cluster Instance, refer to here. |
[3] |
Configure LVM shared storage on all Nodes of Failover Cluster Instance, refer to here. |
[4] | Disable SQL Server on all Nodes of Failover Cluster Instance and also install SQL Server for HA package. |
root@node01:~#
vi /etc/hosts # add cluster nodes # [MSSQL_HA] ⇒ any name that is used as SQL server resource name 10.0.0.50 MSSQL_HA 10.0.0.51 node01 node01.srv.world 10.0.0.52 node02 node02.srv.world |
[5] | Add filesystem resource and copy SQL Server Data to it on a Node that shared storage is active. |
root@node01:~# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync * Current DC: node01.srv.world (version 2.1.2-ada5c3b36e2) - partition with quorum * Last updated: Tue Dec 5 10:27:14 2023 * Last change: Tue Dec 5 10:25:44 2023 by root via cibadmin on node01.srv.world * 2 nodes configured * 2 resource instances configured Node List: * Online: [ node01.srv.world node02.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node01.srv.world * Resource Group: ha_group: * lvm_ha (ocf:heartbeat:LVM-activate): Started node01.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled
root@node01:~#
cp -pR /var/opt/mssql/data /var/opt/mssql/data.bk
# create filesystem resource root@node01:~# pcs resource create mssql_fs ocf:heartbeat:Filesystem device=/dev/vg_ha/lv_ha directory=/var/opt/mssql/data fstype=ext4 --group ha_group
pcs resource status * Resource Group: ha_group: * lvm_ha (ocf:heartbeat:LVM-activate): Started node01.srv.world * mssql_fs (ocf:heartbeat:Filesystem): Started node01.srv.world
root@node01:~#
root@node01:~# pcs constraint order start lvm_ha then mssql_fs Adding lvm_ha mssql_fs (kind: Mandatory) (Options: first-action=start then-action=start) root@node01:~# pcs constraint colocation add mssql_fs with lvm_ha
df -hT /var/opt/mssql/data Filesystem Type Size Used Avail Use% Mounted on /dev/mapper/vg_ha-lv_ha ext4 9.8G 24K 9.3G 1% /var/opt/mssql/dataroot@node01:~# cp -pR /var/opt/mssql/data.bk/* /var/opt/mssql/data/ root@node01:~# chown mssql:mssql /var/opt/mssql/data root@node01:~# ll /var/opt/mssql/data total 136732 drwxr-xr-x 3 mssql mssql 4096 Dec 5 10:28 ./ drwxrwx--- 7 mssql mssql 4096 Dec 5 10:28 ../ -rw-rw---- 1 mssql mssql 256 Dec 4 13:39 Entropy.bin drwx------ 2 root root 16384 Dec 5 10:24 lost+found/ -rw-rw---- 1 mssql mssql 4915200 Dec 4 15:14 master.mdf -rw-rw---- 1 mssql mssql 1835008 Dec 4 15:14 mastlog.ldf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 modellog.ldf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 model.mdf -rw-rw---- 1 mssql mssql 16056320 Dec 4 15:14 model_msdbdata.mdf -rw-rw---- 1 mssql mssql 1048576 Dec 4 15:14 model_msdblog.ldf -rw-rw---- 1 mssql mssql 1835008 Dec 4 15:14 model_replicatedmaster.ldf -rw-rw---- 1 mssql mssql 4915200 Dec 4 15:14 model_replicatedmaster.mdf -rw-rw---- 1 mssql mssql 16056320 Dec 4 15:14 msdbdata.mdf -rw-rw---- 1 mssql mssql 1048576 Dec 4 15:14 msdblog.ldf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 tempdb2.ndf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 tempdb3.ndf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 tempdb4.ndf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 tempdb5.ndf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 tempdb6.ndf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 tempdb7.ndf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 tempdb8.ndf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 tempdb.mdf -rw-rw---- 1 mssql mssql 8388608 Dec 4 15:14 templog.ldf |
[6] | Start SQL Server and add an account for Pacemaker on a Node that shared storage is active. |
root@node01:~# systemctl start mssql-server root@node01:~# sqlcmd -S localhost -U SA Password: # comfirm the servername and change it to cluster resource name 1> select @@servername 2> go -------------------------------------------------------------------------------------------------------------------------------- localhost (1 rows affected) 1> exec sp_dropserver 'localhost' 2> exec sp_addserver 'MSSQL_HA', 'local' 3> go 1> use master; 2> go # add [hacluster] user 1> create login hacluster with PASSWORD= N'password'; 2> go # add [sysadmin] role 1> alter server role [sysadmin] add member hacluster; 2> go 1> exitroot@node01:~# systemctl stop mssql-server
|
[7] | On all Nodes, add SQL Server account info that you created above. |
root@node01:~# echo 'hacluster' > /var/opt/mssql/secrets/passwd root@node01:~# echo 'password' >> /var/opt/mssql/secrets/passwd root@node01:~# chown root:root /var/opt/mssql/secrets/passwd root@node01:~# chmod 600 /var/opt/mssql/secrets/passwd |
[8] | Add SQL Server and Virtual IP address to Cluster Resource. |
root@node01:~# pcs resource create MSSQL_HA ocf:mssql:fci op start interval=60s --group ha_group root@node01:~# pcs resource create VIP ocf:heartbeat:IPaddr2 ip=10.0.0.50 cidr_netmask=24 op monitor interval=30s --group ha_group root@node01:~# pcs resource status * Resource Group: ha_group: * lvm_ha (ocf:heartbeat:LVM-activate): Started node01.srv.world * mssql_fs (ocf:heartbeat:Filesystem): Started node01.srv.world * MSSQL_HA (ocf:mssql:fci): Started node01.srv.world * VIP (ocf:heartbeat:IPaddr2): Started node01.srv.worldroot@node01:~# pcs constraint order start mssql_fs then MSSQL_HA Adding mssql_fs MSSQL_HA (kind: Mandatory) (Options: first-action=start then-action=start) root@node01:~# pcs constraint colocation add MSSQL_HA with mssql_fs root@node01:~# pcs constraint config Location Constraints: Ordering Constraints: start lvm_ha then start mssql_fs (kind:Mandatory) start mssql_fs then start MSSQL_HA (kind:Mandatory) Colocation Constraints: lvm_ha with scsi-shooter (score:INFINITY) mssql_fs with lvm_ha (score:INFINITY) MSSQL_HA with mssql_fs (score:INFINITY) Ticket Constraints:root@node01:~# pcs resource status * Resource Group: ha_group: * lvm_ha (ocf:heartbeat:LVM-activate): Started node01.srv.world * mssql_fs (ocf:heartbeat:Filesystem): Started node01.srv.world * MSSQL_HA (ocf:mssql:fci): Started node01.srv.world * VIP (ocf:heartbeat:IPaddr2): Started node01.srv.world |
[9] | It's OK. Stop active node manually and verify failover normally. |
root@node01:~# pcs resource status * Resource Group: ha_group: * lvm_ha (ocf:heartbeat:LVM-activate): Started node01.srv.world * mssql_fs (ocf:heartbeat:Filesystem): Started node01.srv.world * MSSQL_HA (ocf:mssql:fci): Started node01.srv.world * VIP (ocf:heartbeat:IPaddr2): Started node01.srv.worldroot@node01:~# sqlcmd -S 10.0.0.50 -U SA -Q 'select @@servername' Password: ---------------------------------------------- MSSQL_HA (1 rows affected) # stop active node root@node01:~# pcs cluster stop node01.srv.world
# on another node root@node02:~# pcs resource status * Resource Group: ha_group: * lvm_ha (ocf:heartbeat:LVM-activate): Started node02.srv.world * mssql_fs (ocf:heartbeat:Filesystem): Started node02.srv.world * MSSQL_HA (ocf:mssql:fci): Started node02.srv.world * VIP (ocf:heartbeat:IPaddr2): Started node02.srv.worldroot@node02:~# sqlcmd -S 10.0.0.50 -U SA -Q 'select @@servername' Password: ---------------------------------------- MSSQL_HA (1 rows affected) |
Sponsored Link |