Web stops responding and times out

All installation and configuration problems and questions

Moderators: gerski, enjay, williamconley, Op3r, Staydog, gardo, mflorell, MJCoate, mcargile, Kumba, Michael_N

Web stops responding and times out

Postby Iz3k34l » Mon Jun 02, 2014 4:25 pm

We are having times where the Web stops responding and times out. I have increase the Maxclients setting per the vici-server-tuning.conf suggestion;
StartServers 450
MinSpareServers 250
MaxSpareServers 500
ServerLimit 768
MaxClients 768
MaxRequestsPerChild 1000
KeepAliveTimeout 15

We have about 70 agents and the dialers are making about 220+ call at a time. I have recently changed our dialers to also provide web Our top output is as follows;


Web/Mysql
top - 16:01:46 up 16:25, 2 users, load average: 0.33, 0.35, 0.33
Tasks: 846 total, 1 running, 845 sleeping, 0 stopped, 0 zombie
Cpu(s): 2.7%us, 0.4%sy, 0.0%ni, 96.8%id, 0.0%wa, 0.0%hi, 0.1%si, 0.0%st
Mem: 64543M total, 13343M used, 51200M free, 114M buffers
Swap: 4063M total, 0M used, 4063M free, 10224M cached
---------------------------------------------------------------------------------------
load average: 0.28, 0.34, 0.32 mysqld 5.2.13-MariaDB-log up 0 day(s), 17:24 hrs
40 threads: 4 running, 44 cached. Queries/slow: 1.6K/0 Cache Hit: 99.94%
Opened tables: 0 RRN: 401 TLW: 3.2M SFJ: 0 SMP: 0 QPS: 0

ID USER HOST DB TIME COMMAND STATE INFO
1 slave 192.168.40.24:49 59065 Binlog Has sent all
65 cron 192.168.40.28:58 asterisk Query statistics SELECT ... FROM call_log where caller_code='V6021601500008630000' and server_ip='192.168.40.28' o
2405304 mysqltop localhost Query show full processlist
---

Dialer 1
top - 16:02:41 up 3 days, 5:31, 4 users, load average: 0.20, 0.20, 0.29
Tasks: 623 total, 3 running, 609 sleeping, 0 stopped, 11 zombie
Cpu(s): 2.2%us, 1.0%sy, 0.0%ni, 96.6%id, 0.1%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 8190120k total, 7870052k used, 320068k free, 234268k buffers
Swap: 4176892k total, 0k used, 4176892k free, 6702716k cached

Dialer 2
top - 16:03:00 up 3 days, 5:32, 3 users, load average: 0.16, 0.10, 0.15
Tasks: 597 total, 1 running, 593 sleeping, 0 stopped, 3 zombie
Cpu(s): 1.7%us, 1.1%sy, 0.0%ni, 97.1%id, 0.1%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 8190116k total, 7952540k used, 237576k free, 245972k buffers
Swap: 4176892k total, 200k used, 4176692k free, 6832976k cached

Dialer 3/Web
top - 16:03:19 up 4 days, 6:57, 2 users, load average: 1.51, 0.77, 0.72
Tasks: 602 total, 1 running, 591 sleeping, 0 stopped, 10 zombie
Cpu(s): 4.3%us, 1.4%sy, 0.0%ni, 93.6%id, 0.1%wa, 0.0%hi, 0.6%si, 0.0%st
Mem: 8190116k total, 7971888k used, 218228k free, 144504k buffers
Swap: 4176892k total, 9360k used, 4167532k free, 5038260k cached

Dialer4/Web
top - 16:03:46 up 4 days, 6:58, 2 users, load average: 0.67, 0.95, 1.04
Tasks: 620 total, 2 running, 618 sleeping, 0 stopped, 0 zombie
Cpu(s): 9.0%us, 3.9%sy, 0.0%ni, 86.5%id, 0.1%wa, 0.0%hi, 0.5%si, 0.0%st
Mem: 8190116k total, 8010164k used, 179952k free, 126304k buffers
Swap: 4176892k total, 14304k used, 4162588k free, 5054024k cached

Dialer 5/Web
top - 16:04:23 up 4 days, 6:58, 2 users, load average: 1.19, 1.54, 1.44
Tasks: 649 total, 3 running, 623 sleeping, 0 stopped, 23 zombie
Cpu(s): 15.6%us, 3.1%sy, 0.0%ni, 80.5%id, 0.5%wa, 0.0%hi, 0.3%si, 0.0%st
Mem: 8190116k total, 7952352k used, 237764k free, 125892k buffers
Swap: 4176892k total, 21576k used, 4155316k free, 5011864k cached

Dialer 6/Web
top - 16:05:11 up 4 days, 6:58, 3 users, load average: 0.03, 0.05, 0.05
Tasks: 577 total, 2 running, 575 sleeping, 0 stopped, 0 zombie
Cpu(s): 1.2%us, 1.2%sy, 0.0%ni, 97.5%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st
Mem: 16078M total, 3639M used, 12439M free, 120M buffers
Swap: 4078M total, 0M used, 4078M free, 1047M cached

The 6th dialer is obviously not being used much but Dialer 3 & 5 are getting beaten. The dialers continually seem to have zombie processes which is always the VD_amd.agi. I don't have dialer 2 & 3 in the web proxy cluster because of some issue with a custom script connecting to the DB.

I am planning on adding more ram to the 8gb dialers to try and help with swap usage.
I also have the log table archive script running at the default 2 months but they are still about 12-18 million records.

Any suggestions to combat this problem is greatly appreciated.
Vicidial Ver.2.6-399a | ViciBox.x86_64-4.0.3.preload.iso
BUILD: 130425-0700
Web/MySQL-4x Xeon X7550 4 Core 2.0GHz. 64GB ram
1x Dialer- Dual Xeon Quad Core 2.66GHz 16GB ram
5x Dialer- Dual Xeon Quad Core 2.66GHz 8GB ram
amfeltec PCI Timers
Iz3k34l
 
Posts: 76
Joined: Fri Feb 08, 2013 4:55 pm

Re: Web stops responding and times out

Postby mflorell » Tue Jun 03, 2014 5:23 am

I would recommend using a dedicated webserver and getting the agents off of the web on your dialers.

Have you checked your slow query log for long running queries?

Could you tell us more about your DB hardware and configuration?
mflorell
Site Admin
 
Posts: 18387
Joined: Wed Jun 07, 2006 2:45 pm
Location: Florida

Re: Web stops responding and times out

Postby Iz3k34l » Tue Jun 03, 2014 7:40 am

Yes i have checked the slow query logs, which there seems to not be any.
I will move them back to the single web server.
Our DB is on an HP Proliant DL580 G7 with 4 Xeon X7550 2 GHz w/64 gb ram and the DB is a Maria DBMS. It is serving web and DB.

I was thinking about setting up a VM webserver.

Thanks again
Vicidial Ver.2.6-399a | ViciBox.x86_64-4.0.3.preload.iso
BUILD: 130425-0700
Web/MySQL-4x Xeon X7550 4 Core 2.0GHz. 64GB ram
1x Dialer- Dual Xeon Quad Core 2.66GHz 16GB ram
5x Dialer- Dual Xeon Quad Core 2.66GHz 8GB ram
amfeltec PCI Timers
Iz3k34l
 
Posts: 76
Joined: Fri Feb 08, 2013 4:55 pm

Re: Web stops responding and times out

Postby mflorell » Tue Jun 03, 2014 8:52 am

What kind of drives and RAID card/RAID level is on your DB server?

Also, what Linux distro and filesystem on your DB?
mflorell
Site Admin
 
Posts: 18387
Joined: Wed Jun 07, 2006 2:45 pm
Location: Florida

Re: Web stops responding and times out

Postby Iz3k34l » Tue Jun 03, 2014 10:43 am

we are using a hardware raid 6 using 300gb SAS 10k drives and the O/S

Linux VCDdial1 3.1.10-1.16-default #1 SMP Wed Jun 27 05:21:40 UTC 2012 (d016078) x86_64 x86_64 x86_64 GNU/Linux
openSUSE 12.1 (x86_64)
VERSION = 12.1
CODENAME = Asparagus
NAME=openSUSE
VERSION = 12.1 (Asparagus)
VERSION_ID="12.1"
PRETTY_NAME="openSUSE 12.1 (Asparagus) (x86_64)"
ID=opensuse
Vicidial Ver.2.6-399a | ViciBox.x86_64-4.0.3.preload.iso
BUILD: 130425-0700
Web/MySQL-4x Xeon X7550 4 Core 2.0GHz. 64GB ram
1x Dialer- Dual Xeon Quad Core 2.66GHz 16GB ram
5x Dialer- Dual Xeon Quad Core 2.66GHz 8GB ram
amfeltec PCI Timers
Iz3k34l
 
Posts: 76
Joined: Fri Feb 08, 2013 4:55 pm

Re: Web stops responding and times out

Postby mflorell » Wed Jun 04, 2014 6:06 am

RAID6 = BAD FOR DATABASES! You will run into throughput issues eventually as that raid type is not designed for quick reads and writes.

And what kind of hardware RAID are you using?

I would suggest moving to an LSILogic MegaRAID Caching Raid Controller with a RAID 10. I would also recommend moving to 15k drives if possible.
mflorell
Site Admin
 
Posts: 18387
Joined: Wed Jun 07, 2006 2:45 pm
Location: Florida

Re: Web stops responding and times out

Postby Iz3k34l » Wed Jun 04, 2014 8:38 am

the system is using an onbord HP raid controller, i will look into the 15k drives and new controller. Also would it be possible to use VM for web only?
Vicidial Ver.2.6-399a | ViciBox.x86_64-4.0.3.preload.iso
BUILD: 130425-0700
Web/MySQL-4x Xeon X7550 4 Core 2.0GHz. 64GB ram
1x Dialer- Dual Xeon Quad Core 2.66GHz 16GB ram
5x Dialer- Dual Xeon Quad Core 2.66GHz 8GB ram
amfeltec PCI Timers
Iz3k34l
 
Posts: 76
Joined: Fri Feb 08, 2013 4:55 pm

Re: Web stops responding and times out

Postby mflorell » Thu Jun 05, 2014 5:23 am

You could probably use a VM for the web portion only, but we usually don't recommend using them for anything in Vicidial.
mflorell
Site Admin
 
Posts: 18387
Joined: Wed Jun 07, 2006 2:45 pm
Location: Florida

Re: Web stops responding and times out

Postby Iz3k34l » Fri Jun 06, 2014 8:38 am

Copy that... can vicidial be installed as a stand-alone IE: without the operating system. I was thinking about setting up a Slackware box and installing lighthttpd for web and a Slackware with MariaDB. since i will have to reinstall the O/S when swapping the raid on the DB server
Vicidial Ver.2.6-399a | ViciBox.x86_64-4.0.3.preload.iso
BUILD: 130425-0700
Web/MySQL-4x Xeon X7550 4 Core 2.0GHz. 64GB ram
1x Dialer- Dual Xeon Quad Core 2.66GHz 16GB ram
5x Dialer- Dual Xeon Quad Core 2.66GHz 8GB ram
amfeltec PCI Timers
Iz3k34l
 
Posts: 76
Joined: Fri Feb 08, 2013 4:55 pm

Re: Web stops responding and times out

Postby mflorell » Fri Jun 06, 2014 9:19 am

We recommend using ViciBox, which is our ISO installer based upon OpenSuSE.

I would recommend staying with Apache for the web server, since that is what we do our testing on. Also, if you are installing from scratch, don't forget eaccellerator or another PHP caching engine, it triples your web server capacity in Vicidial.
mflorell
Site Admin
 
Posts: 18387
Joined: Wed Jun 07, 2006 2:45 pm
Location: Florida

Re: Web stops responding and times out

Postby Iz3k34l » Fri Jun 06, 2014 12:22 pm

Is the back script included (ADMIN_backup_something or other) in Vicidial sufficient to restore from on a fresh install?
Vicidial Ver.2.6-399a | ViciBox.x86_64-4.0.3.preload.iso
BUILD: 130425-0700
Web/MySQL-4x Xeon X7550 4 Core 2.0GHz. 64GB ram
1x Dialer- Dual Xeon Quad Core 2.66GHz 16GB ram
5x Dialer- Dual Xeon Quad Core 2.66GHz 8GB ram
amfeltec PCI Timers
Iz3k34l
 
Posts: 76
Joined: Fri Feb 08, 2013 4:55 pm

Re: Web stops responding and times out

Postby mflorell » Fri Jun 06, 2014 7:58 pm

It can restore the Vicidial settings, but not any Linux settings you have.
mflorell
Site Admin
 
Posts: 18387
Joined: Wed Jun 07, 2006 2:45 pm
Location: Florida


Return to Support

Who is online

Users browsing this forum: No registered users and 110 guests

cron