Nagios监控lvs服务
?1在lvs?server上安裝nrpe客戶端:
1.1,rpm方式安裝nrpe客戶端
下載地址:http://download.csdn.net/detail/mchdba/7493875
[root@localhost nagios]# ll總計 768 -rw-r--r-- 1 root root 713389 12-16 12:08 nagios-plugins-1.4.11-1.x86_64.rpm -rw-r--r-- 1 root root 32706 12-16 12:09 nrpe-2.12-1.x86_64.rpm -rw-r--r-- 1 root root 18997 12-16 12:08 nrpe-plugin-2.12-1.x86_64.rpm [root@localhost nagios]# rpm -ivh *.rpm --nodeps --force?
1.2?在配置文件最末尾,加入配置信息以及監控主機serverip地址
[root@ localhost nagios]# vim /etc/nagios/nrpe.cfg # add by tim on 2014-06-11 command[check_users]=/usr/local/nagios/libexec/check_users -w 8 -c 15 command[check_load]=/usr/local/nagios/libexec/check_load -w 15,10,5 -c 30,25,20 command[check_sda1]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% -p /dev/sda command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z #command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 50 -c 80 command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 750 -c 800 command[check-host-alive]=/usr/local/nagios/libexec/check_ping -H localhost -w 3000.0,80% -c 5000.0,100% -p 5 allowed_hosts = 127.0.0.1, 10.2xx.3.xxcheck下命令是否生效:
[root@web-9 nrpe-2.15]# /usr/local/nagios/libexec/check_users -w 8 -c 15 USERS OK - 2 users currently logged in |users=2;8;15;0 [root@web-9 nrpe-2.15]#看到已經USERS OK -….命令已經生效。
?
1.3?啟動nrpe報錯例如以下:
[root@web-9 ~]# service nrpe restart Shutting down nrpe: [失敗] Starting nrpe: /usr/sbin/nrpe: error while loading shared libraries: libssl.so.6: cannot open shared object file: No such file or directory[失敗] [root@web-9 ~]# [root@db-m2-slave-1 nagios_client]# service nrpe start Starting nrpe: /usr/sbin/nrpe: error while loading shared libraries: libssl.so.6: cannot open shared object file: No such file or directory[失敗] [root@db-m2-slave-1 nagios_client]#建立連接
[root@db-m2-slave-1 nagios_client]# ln -s /usr/lib64/libssl.so /usr/lib64/libssl.so.6(假設沒有libssl.so,就採用別的libssl.so.10來做軟連接,ln -s /usr/lib64/libssl.so.10 /usr/lib64/libssl.so.6) [root@db-m2-slave-1 nagios_client]#再又一次啟動例如以下:
[root@db-m2-slave-1 nagios_client]# service nrpe start Starting nrpe: /usr/sbin/nrpe: error while loading shared libraries: libcrypto.so.6: cannot open shared object file: No such file or directory[失敗] [root@web-10 ~]# ll /usr/lib64/libcrypto.so lrwxrwxrwx. 1 root root 18 10月 13 2013 /usr/lib64/libcrypto.so -> libcrypto.so.1.0.0 [root@db-m2-slave-1 nagios_client]#再建鏈接:
?
1.4?檢測下nrpe是否正常執行:
去nagiosserver端check下
[root@cache-2 ~]# /usr/local/nagios/libexec/check_nrpe -H xx.xx3.xx NRPE v2.12 [root@cache-2 ~]#[root@cache-2 ~]#? /usr/local/nagios/libexec/check_nrpe -H xx.xx3.xx
NRPE v2.12
[root@cache-2 ~]#
看到返回NRPE v2.15表示已經連接成功。
?
2?編寫shell腳本實現lvs監控
2.1?監控腳本
Nagios里面沒有現成的監控lvs的狀態腳本,所以須要去網上找一個簡單的監控腳本check_lvs.sh,copy到/usr/lib/nagios/plugins/文件夾,賦予nagios權限,腳本內容例如以下:
2.2 nrpe.cfg里面配置例如以下
Vim /etc/nagios/nrpe.cfg,在里面加入一行check_lvs命令:
command[check_lvs]=/usr/lib/nagios/plugins/check_lvs -w 300 -c 600 之后重新啟動nrpe[root@/root/nagios/check_lvs ~]# service nrpe restart; Shutting down nrpe: [確定] Starting nrpe: [確定] [root@/root/nagios/check_lvs ~]#service nrpe restart;2.3?去nagios服務端check一下
[root@cache-2 ~]# /usr/local/nagios/libexec/check_nrpe -H 1x.xx4.x.x5 -c check_lvslvs critical,lvs is down now. [root@cache-2 ~]#看到check出來lvs服務已經處于down模式。
說明:因為check_lvs是要調用ipvsadm命令來獲取LVS狀態的,而ipvsadm命令是僅僅能以root用戶來執行的, 所以須要將nagios用戶設置成能夠無需password直接su成root,這樣就能以nagios用戶執行命令sudo /usr/lib/nagios/plugins/check_lvs?。在centos系統中,無法直接調用sudo命令,須要改動/etc/sudoers,?找到 #Defaults requiretty 并取消凝視,另外新增一行。表示nagios用戶不須要登陸終端就能夠調用命令,例如以下所看到的:
Defaults requiretty Defaults:nagios !requiretty #加入nagios 請求sudo,同意特定指令時(可跟參數),不須要password(如)。 nagios ALL=(ALL) NOPASSWD: ALL?再去naigosserver上面check下,已經生效,例如以下所看到的:
[root@cache-2 etc]# /usr/local/nagios/libexec/check_nrpe -H 10.xx.xx.xx -c check_lvs LVS OK - LVS is running (conn: 16 active, 77 inactive)|active=16;69999;99999;0; inactive=77;69999;99999;0; [root@cache-2 etc]#?2.4?在nagiosserver上加入配置
vim services.cfg define service{host_name lvs-lanservice_description Check lvscheck_command check_nrpe!check_lvsmax_check_attempts 5normal_check_interval 3retry_check_interval 2check_period 24x7notification_interval 10notification_period 24x7notification_options w,c,rcontact_groups opsweb} vim objects/commands.cfg define command{command_name check_lvscommand_line $USER1$/check_lvs -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$}之后又一次載入nagios既完畢了對lvs的監控服務。
[root@cache-2 etc]# service nagios reload Running configuration check... Reloading nagios configuration... done [root@cache-2 etc]#?至此,nagios以下對lvs服務的監控已經完畢。
?
參考資料:http://c20031776.blog.163.com/blog/static/684716252013627506890/
轉載于:https://www.cnblogs.com/mengfanrong/p/4316431.html
與50位技術專家面對面20年技術見證,附贈技術全景圖總結
以上是生活随笔為你收集整理的Nagios监控lvs服务的全部內容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: python 数据结构-字典
- 下一篇: hdu 3047 Zjnu Stadiu