- Code: Select all
would it be safe to say maybe 200 agents?
Um. ... No.
htop and uptime show "1 5 10 minute average server load". This is a very useful tool. It also has a bar chart showing instant usage on each processor.
If you are anywhere near 50% cpu usage (ie: with 8 processors, that would be 4.0) at 1,5, or 10 minute load averages, you are AT capacity. If you EVER go over 50% ... you need to be looking to add a 2nd server and making a cluster. IF, however, you are nowhere near 50% usage, that doesn't mean you won't be there soon. These systems do not load "linear", but "Exponential". You will find that once you breech 50%, you will hit 75% pretty quick and the jump to 100% is really fast and then you'll hit 500% in a heartbeat and 2400% my not be shown because the processor is too hammered to discuss it with the htop application.
Or to put it more succinctly, just because the server isn't sweating yet, does not mean it will still hold up when it does. There may also be some limitations in the number of simultaneous Conferences. You should also watch for network backlog in the near future.
A couple points: You listed "2 x Intel(R) Xeon(R) CPU E5640 @ 2.67GHz" but did not mention how many cores total. While this information is certainly available on the ark, it would be simpler to post it as a "Total 8 cores" addendum.
If you have hyperthreading on, consider turning it off or at least paying attention to high load days and consider turning it off Then and seeing if there is a difference (other than a false number of CPUs in htop, LOL). We have had slightly better experience with it off than on, but would certainly be interested in hearing other responses.
Note that when you breach 125 users (approximate!!!) you will enter a new realm of "why did THAT break?". Be sure you have a duplicate system and extra networking architecture to be able to break off into two servers at half capacity (or at least move 25 agents into it ... to reduce load on the primary system) in an emergency. Networking will become extremely important, anything imperfect will kill your dialer. Gigabit switches and routers everywhere! No bottlenecks.
Consider moving your database to a replication server and running reports there. Same with all recordings (an archive server with all recordings pushed via FTP).
Happy Hunting!