心跳丢失造成RAC节点驱逐
趕上了遷庫風,我們的一套再建生產(chǎn)業(yè)務庫發(fā)生了主機重啟,什么原因呢
[root@nqzeyddb2 ~]# uptime
15:42:46 up 5 days, 22:29, ?4 users, ?load average: 0.08, 0.09, 0.10
alert日志里面沒有明顯告警,crs里面是有明顯的超時告警的,懷疑是心跳網(wǎng)卡丟失造成腦裂進行的節(jié)點驅(qū)逐
2021-10-09 00:47:55.743: [ ? ?CSSD][580302592]clssnmPollingThread: node nqzeyddb1 (1) at 50% heartbeat fatal, removal in 14.910 seconds
2021-10-09 00:47:55.743: [ ? ?CSSD][580302592]clssnmPollingThread: node nqzeyddb1 (1) is impending reconfig, flag 2491406, misstime 15090
2021-10-09 00:47:55.743: [ ? ?CSSD][580302592]clssnmPollingThread: local diskTimeout set to 27000 ms, remote disk timeout set to 27000, impending reconfig status(1)
2021-10-09 00:47:55.743: [ ? ?CSSD][586610432]clssnmvDHBValidateNcopy: node 1, nqzeyddb1, has a disk HB, but no network HB, DHB has rcfg 528920094, wrtcnt, 880951, LATS 286157624, lastSeqNo 880924, uniqueness 1633579205, timestamp 1633711660/286147104
2021-10-09 00:47:55.863: [ ? ?CSSD][589764352]clssnmvDiskPing: Writing with status 0x3, timestamp 1633711675/286157744
2021-10-09 00:47:56.144: [ ? ?CSSD][594495232]clssnmvDiskPing: Writing with status 0x3, timestamp 1633711676/286158024
2021-10-09 00:47:56.306: [ ? ?CSSD][943818496]clssgmpcBuildNodeList: nodename for node 0 is NULL
2021-10-09 00:48:07.745: [ ? ?CSSD][580302592]clssnmPollingThread: node nqzeyddb1 (1) at 90% heartbeat fatal, removal in 2.910 seconds,
2021-10-09 00:48:10.656: [ ? ?CSSD][580302592]clssnmMarkNodeForRemoval: node 1, nqzeyddb1 marked for removal
后找網(wǎng)絡同事幫忙查看一下交換機有無異常
結(jié)果顯而易見~
總結(jié)
以上是生活随笔為你收集整理的心跳丢失造成RAC节点驱逐的全部內(nèi)容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: VPS上安装Zpanle面板
- 下一篇: CMD命令下载远程文件