天天看點

關于nagios監控遠端伺服器對伺服器性能影響的測試

1. Nagios監視遠端伺服器時,是通過在控制端執行以下指令進行資料收集的:

/usr/local/nagios/libexec/check_http -I 192.168.16.11

/usr/local/nagios/libexec/check_ssh -H 192.168.16.11

/usr/local/nagios/libexec/check_ping -H 192.168.16.11 -w 100.0,20% -c 500.0,60% -p 5

/usr/local/nagios/libexec/check_tcp -H 192.168.16.11 -p 22

/usr/local/nagios/libexec/check_snmp_load.pl -H 192.168.16.11 -C public -w 70% -c 90% -T netsc -f

/usr/local/nagios/libexec/check_traffic.sh -V 2c -C public -H 192.168.16.11 -I 0 -w 400,200 -c 600,400 -M -b

/usr/local/nagios/libexec/check_snmp_storage.pl -H 192.168.16.11 -C public -m "^/" -w 80% -c 90% -f

/usr/local/nagios/libexec/check_snmp_storage.pl -H 192.168.16.11 -C public -m "memory|Memory" -w 80% -c 90% -f

/usr/local/nagios/libexec/check_snmp -H '192.168.16.11' -C public -o .1.3.6.1.4.1.2021.11.9.0,.1.3.6.1.4.1.2021.11.10.0,.1.3.6.1.4.1.2021.11.11.0 -l 'CPU usage (user system idle)' -u '%'

/usr/local/nagios/libexec/check_snmp_load.pl -H 192.168.16.11 -C public -w 3,3,2 -c 4,4,3 -T netsl -f

2. 測試方法:

1)通過nagios運作時,監視收集遠端伺服器資源使用資訊;

2)通過在控制端每s執行監控指令,監視收集遠端伺服器資源使用資訊;

3. 測試結果:

方法一:nagios運作時,在遠端伺服器分别執行top -b -d 2 -p21472 >/tmp/top_snmp.log、top -b -d 2 >/tmp/top_all.log,執行時間約為3小時,分析收集到的日志得知,監視時系統資源使用情況正常。

top_snmp.log部分資訊

top - 15:17:25 up 192 days, 22:16, 4 users, load average: 0.00, 0.00, 0.00

Tasks: 1 total, 0 running, 1 sleeping, 0 stopped, 0 zombie

Cpu(s): 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st

Mem: 32933092k total, 21012132k used, 11920960k free, 520748k buffers

Swap: 16779884k total, 0k used, 16779884k free, 16376668k cached

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND

root 15 0 157m 8616 4424 S 0.0 0.0 0:12.07 snmpd

top_all.log部分資訊

top - 09:51:22 up 192 days, 16:50, 5 users, load average: 0.00, 0.00, 0.00

Tasks: 258 total, 1 running, 257 sleeping, 0 stopped, 0 zombie

Cpu(s): 0.2%us, 0.1%sy, 0.0%ni, 99.7%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st

Mem: 32933092k total, 20603272k used, 12329820k free, 520164k buffers

Swap: 16779884k total, 0k used, 16779884k free, 15974748k cached

root 15 0 10372 692 584 S 0.0 0.0 0:02.33 init

root RT -5 0 0 0 S 0.0 0.0 0:00.23 migration/0

root 34 19 0 0 0 S 0.0 0.0 0:00.04 ksoftirqd/0

方法二:在控制端執行腳本plugins_test.sh,執行指令為:watch –d –n 1 sh plugins_test.sh,同時在遠端伺服器上執行top -b -d 1 -p21472 >top_snmp_time.log、watch -d -n 1 vmstatg、watch -d -n 1 free -m,5min後檢視日志,分析收集到的日志得知,監視時系統資源使用情況正常。

Free指令10s之間的變化數

total used free shared buffers cached

Mem: 32161 20527 11633 0 508 15983

-/+ buffers/cache: 4035 28125

Swap: 16386 0 16386

Mem: 32161 20528 11632 0 508 15983

vmstat指令10s之間的變化數

procs -----------memory---------- ---swap-- -----io---- --system-- -----cpu------

r b swpd free buff cache si so bi bo in cs us sy id wa st

0 0 11912152 520788 16367288 0 0 0 3 0 0 0 0 100 0 0

0 0 11912484 520788 16367332 0 0 0 3 0 0 0 0 100 0 0

top_snmp_time.log部分資訊

top - 16:22:15 up 192 days, 23:21, 9 users, load average: 0.00, 0.00, 0.00

Mem: 32933092k total, 21018792k used, 11914300k free, 520780k buffers

Swap: 16779884k total, 0k used, 16779884k free, 16374824k cached

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND

root 15 0 157m 8616 4424 S 0.0 0.0 0:13.08 snmpd

top - 16:22:16 up 192 days, 23:21, 9 users, load average: 0.00, 0.00, 0.00

top - 16:22:17 up 192 days, 23:21, 9 users, load average: 0.00, 0.00, 0.00

Mem: 32933092k total, 21018680k used, 11914412k free, 520780k buffers

root 15 0 157m 8616 4424 S 0.0 0.0 0:13.08 snmpd

4. 測試結果:

通過以上資料分析後可知:在利用nagios的自身插件及snmp協定(共對遠端伺服器的10個服務時行監控)監控遠端伺服器時,對遠端伺服器資源使用情況無影響。當大于10個服務時,需要另行評測。

5. 遠端伺服器配置情況:

[root@xxx-xxx ~]# cat /proc/cpuinfo | grep name | cut -f2 -d: | uniq -c &cat /proc/cpuinfo | grep physical | uniq -c

[1] 13558

physical id : 0

address sizes : 40 bits physical, 48 bits virtual

physical id : 1

Intel(R) Xeon(R) CPU E5620 @ 2.40GHz

[root@xxx-xxx ~]# cat /etc/issue&free -g

[1] 13753

CentOS release 5.6 (Final)

Kernel \r on an \m

total used free shared buffers cached

Mem: 31 20 11 0 0 15

-/+ buffers/cache: 3 27

繼續閱讀