Hey all, I'm fairly new to managing servers and I seem to have a problem, however, I do not seem to be able to work out what is causing it.
I keep receiving the alarm level changed email, and the status of the server is regularly "red".
I've doubled the size of the CPU power and increased the RAM on the server, but the alarms keep ringing.
I have read a few posts, and below I have pasted various results that I have got from the server, along with the alarm alert email.
I'm now completely and utterly stuck as to what I need to do.
Any help, would be appreciated
I'm happy to post more information as needed.
Thank you in advance
www.serveraddress.com: alarm level changed.
Server health parameter "CPU > Total usage" changed its status from "green" to "red".
top - 13:55:17 up 1 day, 23:28, 0 users, load average: 2.05, 2.04, 1.54
Tasks: 114 total, 2 running, 112 sleeping, 0 stopped, 0 zombie
Cpu(s): 27.8%us, 7.5%sy, 0.0%ni, 64.5%id, 0.2%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 3145728k total, 1731168k used, 1414560k free, 67360k buffers
Swap: 1959920k total, 0k used, 1959920k free, 976748k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
23640 userac 25 0 33348 5920 1176 S 102.0 0.2 23:02.91 perl
23891 userac 25 0 33344 5920 1180 R 100.0 0.2 19:21.40 perl
1 root 15 0 10352 752 628 S 0.0 0.0 0:01.87 init
2 root RT -5 0 0 0 S 0.0 0.0 0:00.21 migration/0
3 root 34 19 0 0 0 S 0.0 0.0 0:00.04 ksoftirqd/0
[cropped to allow posting]
# ls -l /proc/23640
total 0
dr-xr-xr-x 2 userac psacln 0 Jan 5 14:29 attr
-r-------- 1 userac psacln 0 Jan 5 14:29 auxv
-r--r--r-- 1 userac psacln 0 Jan 5 13:31 cmdline
-rw-r--r-- 1 userac psacln 0 Jan 5 14:29 coredump_filter
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 cpuset
lrwxrwxrwx 1 userac psacln 0 Jan 5 14:29 cwd -> /var/tmp
-r-------- 1 userac psacln 0 Jan 5 14:29 environ
lrwxrwxrwx 1 userac psacln 0 Jan 5 13:59 exe -> /usr/bin/perl
dr-x------ 2 userac psacln 0 Jan 5 14:29 fd
-r--r--r-- 1 userac psacln 0 Jan 5 13:31 io
-r-------- 1 userac psacln 0 Jan 5 14:29 limits
-rw-r--r-- 1 userac psacln 0 Jan 5 14:29 loginuid
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 maps
-rw------- 1 userac psacln 0 Jan 5 14:29 mem
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 mounts
-r-------- 1 userac psacln 0 Jan 5 14:29 mountstats
-rw-r--r-- 1 userac psacln 0 Jan 5 14:29 oom_adj
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 oom_score
lrwxrwxrwx 1 userac psacln 0 Jan 5 14:29 root -> /
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 schedstat
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 smaps
-r--r--r-- 1 userac psacln 0 Jan 5 13:31 stat
-r--r--r-- 1 userac psacln 0 Jan 5 13:55 statm
-r--r--r-- 1 userac psacln 0 Jan 5 13:32 status
dr-xr-xr-x 3 userac psacln 0 Jan 5 13:31 task
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 wchan
# ls -l /proc/23891
total 0
dr-xr-xr-x 2 userac psacln 0 Jan 5 14:30 attr
-r-------- 1 userac psacln 0 Jan 5 14:30 auxv
-r--r--r-- 1 userac psacln 0 Jan 5 13:35 cmdline
-rw-r--r-- 1 userac psacln 0 Jan 5 14:30 coredump_filter
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 cpuset
lrwxrwxrwx 1 userac psacln 0 Jan 5 14:30 cwd -> /var/tmp
-r-------- 1 userac psacln 0 Jan 5 14:30 environ
lrwxrwxrwx 1 userac psacln 0 Jan 5 13:59 exe -> /usr/bin/perl
dr-x------ 2 userac psacln 0 Jan 5 14:30 fd
-r--r--r-- 1 userac psacln 0 Jan 5 13:35 io
-r-------- 1 userac psacln 0 Jan 5 14:30 limits
-rw-r--r-- 1 userac psacln 0 Jan 5 14:30 loginuid
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 maps
-rw------- 1 userac psacln 0 Jan 5 14:30 mem
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 mounts
-r-------- 1 userac psacln 0 Jan 5 14:30 mountstats
-rw-r--r-- 1 userac psacln 0 Jan 5 14:30 oom_adj
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 oom_score
lrwxrwxrwx 1 userac psacln 0 Jan 5 14:30 root -> /
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 schedstat
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 smaps
-r--r--r-- 1 userac psacln 0 Jan 5 13:35 stat
-r--r--r-- 1 userac psacln 0 Jan 5 13:55 statm
-r--r--r-- 1 userac psacln 0 Jan 5 13:37 status
dr-xr-xr-x 3 userac psacln 0 Jan 5 13:35 task
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 wchan
# ps auxw | grep perl
root 29737 0.0 0.0 61148 768 pts/0 R+ 14:30 0:00 grep perl
# lsof -p 23640 |more
COMMAND PID USER FD TYPE DEVICE SIZE NODE NAME
perl 23640 userac cwd DIR 253,1 58 172 /var/tmp
perl 23640 userac rtd DIR 202,1 4096 2 /
perl 23640 userac txt REG 253,0 13696 7411 /usr/bin/perl
perl 23640 userac mem REG 202,1 137256 310334 /lib64/ld-2.5.so
perl 23640 userac mem REG 253,0 1259888 12586129 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/CORE/libperl.so
perl 23640 userac mem REG 202,1 89800 310316 /lib64/libresolv-2.5.so
perl 23640 userac mem REG 202,1 111480 310331 /lib64/libnsl-2.5.so
perl 23640 userac mem REG 202,1 20424 310309 /lib64/libdl-2.5.so
perl 23640 userac mem REG 202,1 611880 310292 /lib64/libm-2.5.so
perl 23640 userac mem REG 202,1 45728 310317 /lib64/libcrypt-2.5.so
perl 23640 userac mem REG 202,1 15280 310350 /lib64/libutil-2.5.so
perl 23640 userac mem REG 202,1 142696 310291 /lib64/libpthread-2.5.so
perl 23640 userac mem REG 202,1 1712536 310298 /lib64/libc-2.5.so
perl 23640 userac mem REG 253,0 18080 4198489 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/auto/IO/IO.so
perl 23640 userac mem REG 253,0 21424 12586731 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/auto/Socket/Socket.so
perl 23640 userac mem REG 202,1 53880 310340 /lib64/libnss_files-2.5.so
perl 23640 userac 0u unix 0xffff8800a679ef00 15659277 /var/run/mod_fcgid/sock/2180.975
perl 23640 userac 1w FIFO 0,6 15659594 pipe
perl 23640 userac 2w FIFO 0,6 15659594 pipe
perl 23640 userac 3u unix 0xffff88000004e600 15659407 /var/run/mod_fcgid/sock/2180.975
perl 23640 userac 4u IPv4 15659654 TCP www.serveraddress.com:40659->chi4.vm.bitvps.com:irdmi (ESTABLISHED)
perl 23640 userac 45r FIFO 0,6 5180 pipe
perl 23640 userac 48w FIFO 0,6 5181 pipe
perl 23640 userac 49w FIFO 0,6 15659188 pipe
perl 23640 userac 50w FIFO 0,6 15659242 pipe
perl 23640 userac 51w FIFO 0,6 15659189 pipe
perl 23640 userac 53w FIFO 0,6 15659190 pipe
perl 23640 userac 54w FIFO 0,6 15659243 pipe
perl 23640 userac 56w FIFO 0,6 15659244 pipe
# lsof -p 23891 |more
COMMAND PID USER FD TYPE DEVICE SIZE NODE NAME
perl 23891 userac cwd DIR 253,1 58 172 /var/tmp
perl 23891 userac rtd DIR 202,1 4096 2 /
perl 23891 userac txt REG 253,0 13696 7411 /usr/bin/perl
perl 23891 userac mem REG 202,1 137256 310334 /lib64/ld-2.5.so
perl 23891 userac mem REG 253,0 1259888 12586129 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/CORE/libperl.so
perl 23891 userac mem REG 202,1 89800 310316 /lib64/libresolv-2.5.so
perl 23891 userac mem REG 202,1 111480 310331 /lib64/libnsl-2.5.so
perl 23891 userac mem REG 202,1 20424 310309 /lib64/libdl-2.5.so
perl 23891 userac mem REG 202,1 611880 310292 /lib64/libm-2.5.so
perl 23891 userac mem REG 202,1 45728 310317 /lib64/libcrypt-2.5.so
perl 23891 userac mem REG 202,1 15280 310350 /lib64/libutil-2.5.so
perl 23891 userac mem REG 202,1 142696 310291 /lib64/libpthread-2.5.so
perl 23891 userac mem REG 202,1 1712536 310298 /lib64/libc-2.5.so
perl 23891 userac mem REG 253,0 18080 4198489 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/auto/IO/IO.so
perl 23891 userac mem REG 253,0 21424 12586731 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/auto/Socket/Socket.so
perl 23891 userac mem REG 202,1 53880 310340 /lib64/libnss_files-2.5.so
perl 23891 userac 0u unix 0xffff8800a679ef00 15659277 /var/run/mod_fcgid/sock/2180.975
perl 23891 userac 1w FIFO 0,6 15660479 pipe
perl 23891 userac 2w FIFO 0,6 15660479 pipe
perl 23891 userac 3w FIFO 0,6 15660479 pipe
perl 23891 userac 4u IPv4 15660485 TCP www.serveraddress.com:40673->chi4.vm.bitvps.com:irdmi (ESTABLISHED)
perl 23891 userac 45r FIFO 0,6 5180 pipe
perl 23891 userac 48w FIFO 0,6 5181 pipe
perl 23891 userac 49w FIFO 0,6 15659188 pipe
perl 23891 userac 50w FIFO 0,6 15659242 pipe
perl 23891 userac 51w FIFO 0,6 15659189 pipe
perl 23891 userac 53w FIFO 0,6 15659190 pipe
perl 23891 userac 54w FIFO 0,6 15659243 pipe
perl 23891 userac 56w FIFO 0,6 15659244 pipe
I keep receiving the alarm level changed email, and the status of the server is regularly "red".
I've doubled the size of the CPU power and increased the RAM on the server, but the alarms keep ringing.
I have read a few posts, and below I have pasted various results that I have got from the server, along with the alarm alert email.
I'm now completely and utterly stuck as to what I need to do.
Any help, would be appreciated
I'm happy to post more information as needed.
Thank you in advance
www.serveraddress.com: alarm level changed.
Server health parameter "CPU > Total usage" changed its status from "green" to "red".
top - 13:55:17 up 1 day, 23:28, 0 users, load average: 2.05, 2.04, 1.54
Tasks: 114 total, 2 running, 112 sleeping, 0 stopped, 0 zombie
Cpu(s): 27.8%us, 7.5%sy, 0.0%ni, 64.5%id, 0.2%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 3145728k total, 1731168k used, 1414560k free, 67360k buffers
Swap: 1959920k total, 0k used, 1959920k free, 976748k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
23640 userac 25 0 33348 5920 1176 S 102.0 0.2 23:02.91 perl
23891 userac 25 0 33344 5920 1180 R 100.0 0.2 19:21.40 perl
1 root 15 0 10352 752 628 S 0.0 0.0 0:01.87 init
2 root RT -5 0 0 0 S 0.0 0.0 0:00.21 migration/0
3 root 34 19 0 0 0 S 0.0 0.0 0:00.04 ksoftirqd/0
[cropped to allow posting]
# ls -l /proc/23640
total 0
dr-xr-xr-x 2 userac psacln 0 Jan 5 14:29 attr
-r-------- 1 userac psacln 0 Jan 5 14:29 auxv
-r--r--r-- 1 userac psacln 0 Jan 5 13:31 cmdline
-rw-r--r-- 1 userac psacln 0 Jan 5 14:29 coredump_filter
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 cpuset
lrwxrwxrwx 1 userac psacln 0 Jan 5 14:29 cwd -> /var/tmp
-r-------- 1 userac psacln 0 Jan 5 14:29 environ
lrwxrwxrwx 1 userac psacln 0 Jan 5 13:59 exe -> /usr/bin/perl
dr-x------ 2 userac psacln 0 Jan 5 14:29 fd
-r--r--r-- 1 userac psacln 0 Jan 5 13:31 io
-r-------- 1 userac psacln 0 Jan 5 14:29 limits
-rw-r--r-- 1 userac psacln 0 Jan 5 14:29 loginuid
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 maps
-rw------- 1 userac psacln 0 Jan 5 14:29 mem
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 mounts
-r-------- 1 userac psacln 0 Jan 5 14:29 mountstats
-rw-r--r-- 1 userac psacln 0 Jan 5 14:29 oom_adj
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 oom_score
lrwxrwxrwx 1 userac psacln 0 Jan 5 14:29 root -> /
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 schedstat
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 smaps
-r--r--r-- 1 userac psacln 0 Jan 5 13:31 stat
-r--r--r-- 1 userac psacln 0 Jan 5 13:55 statm
-r--r--r-- 1 userac psacln 0 Jan 5 13:32 status
dr-xr-xr-x 3 userac psacln 0 Jan 5 13:31 task
-r--r--r-- 1 userac psacln 0 Jan 5 14:29 wchan
# ls -l /proc/23891
total 0
dr-xr-xr-x 2 userac psacln 0 Jan 5 14:30 attr
-r-------- 1 userac psacln 0 Jan 5 14:30 auxv
-r--r--r-- 1 userac psacln 0 Jan 5 13:35 cmdline
-rw-r--r-- 1 userac psacln 0 Jan 5 14:30 coredump_filter
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 cpuset
lrwxrwxrwx 1 userac psacln 0 Jan 5 14:30 cwd -> /var/tmp
-r-------- 1 userac psacln 0 Jan 5 14:30 environ
lrwxrwxrwx 1 userac psacln 0 Jan 5 13:59 exe -> /usr/bin/perl
dr-x------ 2 userac psacln 0 Jan 5 14:30 fd
-r--r--r-- 1 userac psacln 0 Jan 5 13:35 io
-r-------- 1 userac psacln 0 Jan 5 14:30 limits
-rw-r--r-- 1 userac psacln 0 Jan 5 14:30 loginuid
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 maps
-rw------- 1 userac psacln 0 Jan 5 14:30 mem
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 mounts
-r-------- 1 userac psacln 0 Jan 5 14:30 mountstats
-rw-r--r-- 1 userac psacln 0 Jan 5 14:30 oom_adj
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 oom_score
lrwxrwxrwx 1 userac psacln 0 Jan 5 14:30 root -> /
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 schedstat
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 smaps
-r--r--r-- 1 userac psacln 0 Jan 5 13:35 stat
-r--r--r-- 1 userac psacln 0 Jan 5 13:55 statm
-r--r--r-- 1 userac psacln 0 Jan 5 13:37 status
dr-xr-xr-x 3 userac psacln 0 Jan 5 13:35 task
-r--r--r-- 1 userac psacln 0 Jan 5 14:30 wchan
# ps auxw | grep perl
root 29737 0.0 0.0 61148 768 pts/0 R+ 14:30 0:00 grep perl
# lsof -p 23640 |more
COMMAND PID USER FD TYPE DEVICE SIZE NODE NAME
perl 23640 userac cwd DIR 253,1 58 172 /var/tmp
perl 23640 userac rtd DIR 202,1 4096 2 /
perl 23640 userac txt REG 253,0 13696 7411 /usr/bin/perl
perl 23640 userac mem REG 202,1 137256 310334 /lib64/ld-2.5.so
perl 23640 userac mem REG 253,0 1259888 12586129 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/CORE/libperl.so
perl 23640 userac mem REG 202,1 89800 310316 /lib64/libresolv-2.5.so
perl 23640 userac mem REG 202,1 111480 310331 /lib64/libnsl-2.5.so
perl 23640 userac mem REG 202,1 20424 310309 /lib64/libdl-2.5.so
perl 23640 userac mem REG 202,1 611880 310292 /lib64/libm-2.5.so
perl 23640 userac mem REG 202,1 45728 310317 /lib64/libcrypt-2.5.so
perl 23640 userac mem REG 202,1 15280 310350 /lib64/libutil-2.5.so
perl 23640 userac mem REG 202,1 142696 310291 /lib64/libpthread-2.5.so
perl 23640 userac mem REG 202,1 1712536 310298 /lib64/libc-2.5.so
perl 23640 userac mem REG 253,0 18080 4198489 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/auto/IO/IO.so
perl 23640 userac mem REG 253,0 21424 12586731 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/auto/Socket/Socket.so
perl 23640 userac mem REG 202,1 53880 310340 /lib64/libnss_files-2.5.so
perl 23640 userac 0u unix 0xffff8800a679ef00 15659277 /var/run/mod_fcgid/sock/2180.975
perl 23640 userac 1w FIFO 0,6 15659594 pipe
perl 23640 userac 2w FIFO 0,6 15659594 pipe
perl 23640 userac 3u unix 0xffff88000004e600 15659407 /var/run/mod_fcgid/sock/2180.975
perl 23640 userac 4u IPv4 15659654 TCP www.serveraddress.com:40659->chi4.vm.bitvps.com:irdmi (ESTABLISHED)
perl 23640 userac 45r FIFO 0,6 5180 pipe
perl 23640 userac 48w FIFO 0,6 5181 pipe
perl 23640 userac 49w FIFO 0,6 15659188 pipe
perl 23640 userac 50w FIFO 0,6 15659242 pipe
perl 23640 userac 51w FIFO 0,6 15659189 pipe
perl 23640 userac 53w FIFO 0,6 15659190 pipe
perl 23640 userac 54w FIFO 0,6 15659243 pipe
perl 23640 userac 56w FIFO 0,6 15659244 pipe
# lsof -p 23891 |more
COMMAND PID USER FD TYPE DEVICE SIZE NODE NAME
perl 23891 userac cwd DIR 253,1 58 172 /var/tmp
perl 23891 userac rtd DIR 202,1 4096 2 /
perl 23891 userac txt REG 253,0 13696 7411 /usr/bin/perl
perl 23891 userac mem REG 202,1 137256 310334 /lib64/ld-2.5.so
perl 23891 userac mem REG 253,0 1259888 12586129 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/CORE/libperl.so
perl 23891 userac mem REG 202,1 89800 310316 /lib64/libresolv-2.5.so
perl 23891 userac mem REG 202,1 111480 310331 /lib64/libnsl-2.5.so
perl 23891 userac mem REG 202,1 20424 310309 /lib64/libdl-2.5.so
perl 23891 userac mem REG 202,1 611880 310292 /lib64/libm-2.5.so
perl 23891 userac mem REG 202,1 45728 310317 /lib64/libcrypt-2.5.so
perl 23891 userac mem REG 202,1 15280 310350 /lib64/libutil-2.5.so
perl 23891 userac mem REG 202,1 142696 310291 /lib64/libpthread-2.5.so
perl 23891 userac mem REG 202,1 1712536 310298 /lib64/libc-2.5.so
perl 23891 userac mem REG 253,0 18080 4198489 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/auto/IO/IO.so
perl 23891 userac mem REG 253,0 21424 12586731 /usr/lib64/perl5/5.8.8/x86_64-linux-thread-multi/auto/Socket/Socket.so
perl 23891 userac mem REG 202,1 53880 310340 /lib64/libnss_files-2.5.so
perl 23891 userac 0u unix 0xffff8800a679ef00 15659277 /var/run/mod_fcgid/sock/2180.975
perl 23891 userac 1w FIFO 0,6 15660479 pipe
perl 23891 userac 2w FIFO 0,6 15660479 pipe
perl 23891 userac 3w FIFO 0,6 15660479 pipe
perl 23891 userac 4u IPv4 15660485 TCP www.serveraddress.com:40673->chi4.vm.bitvps.com:irdmi (ESTABLISHED)
perl 23891 userac 45r FIFO 0,6 5180 pipe
perl 23891 userac 48w FIFO 0,6 5181 pipe
perl 23891 userac 49w FIFO 0,6 15659188 pipe
perl 23891 userac 50w FIFO 0,6 15659242 pipe
perl 23891 userac 51w FIFO 0,6 15659189 pipe
perl 23891 userac 53w FIFO 0,6 15659190 pipe
perl 23891 userac 54w FIFO 0,6 15659243 pipe
perl 23891 userac 56w FIFO 0,6 15659244 pipe