by ronator » Mon Sep 20, 2010 11:42 am
Dear William,
yes, I also prefer htop but on vici 1.3 there ain't no apt-get and yum seems to not have the corresponding repositories; that's why I use top (and if u press "1", you also see all CPUs, but no "GUI-style" ;-) .
The error messages occurs on the CLI right in the second, when the agent hears something like a beep or at least a sound that makes it impossible to understand the caller for at least one second. I checked that with agents knocking on the table when it occured while I was watching the CLI and I could clearly connect the error message with the short sound problem. I checked the load in the webinterface, trying to interpret the HELP; as far as I understood the max possible (but not desired) number is [number of cores x 100] ?!?
System Load: 268 - 21%
Live Channels: 132
When writing this post, the load level started with 96 and got up to 268. The system (all in one, asterisk, http, mysql) is an Dual-Quad core with 2,6GHz (8 cores), so I think adding more CPU is not really needed. But what do you think ? Are these numbers to high ? And what the heck is this procedure "ast_waitfor_nandfds" doing ? I mean, what does "nandfds" stands for ?
What you said about bandwith shouldn't be a problem: all calls are routed from a computing centre and the link has a 1-Gbit bandwidth with QoS. All "ordinary" internet access is routed another way on a different gateway, so all internet connections cannot interfer with the VoIP-packets ... No network-hardware was changed, internally. I guess, the only thing that remains is adding a timing source. Although I dutyfully read the (full) vicidial manuals, I have no idea how I can do this. And actually, I don't know what a transcoder is or how to combine both ideas ... We only use IAX and SIP. Furthermore, I was quite happy with /dev/zap/pseudo and that it did what it was designed for although I didn't really need to understand completely how and whyit does what it does.
The point with the carrier is also a good hint, because from time to time it seems our provider is least-cost-routing our calls, what produced quality problems in the past. I will try to run a test with another carrier, but I have to check the logs to see when this error ocurred for the first time.
So, you'd say it should not be a configuration problem (except the point of the timing source) ?
Thank you [for naming those different possibilities] and all the best wishes,
Ron Salvatore
P.S.: I re-checked level load. some agents logged off and now I have
System Load: 115 - 11%
Live Channels: 95
but the message still occurs (but not that often)