System Configuration:
Multi-Server System, 1 DB/Web Server, 7 dialing servers
ViciBox v.7.0.4-170113
VERSION: 2.14-667a
BUILD: 180331-1715
Asterisk 11.25.1-Vici on all servers
Dual Quad core Xeon in each server Minimum (Slightly varying speeds). DB Server with 128GB of RAM, each dialing server has 16GB RAM. SSD drives in all servers
This particular system has 50 agents spread across the servers, dialing at approximately 7 to 1.
The problem is essentially that occasionally, at various dialing levels, a server will go "Red" in the summary screen. When that happens, all the agents with their phones on that server will get the dreaded Timesync Error. and will be unable to log back in. The asterisk command line shows hundreds of " channel.c:1310 __ast_queue_frame: Exceptionally long voice queue length queuing" warnings.
After a reboot, the "Red" is cleared and phones can re-register, agents can log in, and things will continue as normal.
Here is what I have noticed that does/does not affect the problem and what I have done to date
- Seems to happen most often when a remote client has a particularly bad Internet connection, but not always - Connection can be great and it will still happen
- A servers are in time synchronization with each other
- All servers have remote client IP's http, sip, and rtp white listed only
- System uses SIP trunks and have their IP's white listed.
- All other access to the system on all ports is closed
- Trunks are balanced across all dialing servers
Yes I have read the manual and searched the forums, and while I see others mentioning this issue, no real solution was presented (apart from an NTPdate sync in the crontab - which doesn't work) and that was 2 years ago.
For my clients that I recommend Vicidial to as a dialing server, this is pretty much the ONLY issue they ever run into, but it is quickly becoming the issue that causes them to leave Vicidial.
I would really appreciate ANY help or direction.
Thank you.