[rtg] Results with RTG
Harry Marcson
harrymarcson at gmail.com
Sun Mar 8 21:58:52 EDT 2009
We are looking to integrate RTG into our system, because it seems to be the
only option for our setup. We are looking to poll about 100 switches,
connecting about 2000 servers.
Our current RTG testing results for 1 switch polling showed us that RTG:
1- Seems to have memory leaks, not new news after reading through the
mailing list, but was wondering how people are surviving with this,
especially the ones with the bigger setups. did you apply any specific patch
such as yahoo rtg (yrtg) or own fixes?
2- Has really boring and bad-looking graphs. Good graphs would be the ones
that are from Cacti for example.. If anyone has a better rtgplot.cgi or
improved graphing code, please do share it!
3- Bugs in the graphs. A simple example is a server that got a 800Mbps DDOS
attack and got nullrouted.. RTG did not move the line near the 0Mbps, when
it crashed. The end result is, once the server was restored a few hours
later, the graph line went down to the actual usage of about 10Mbps, but
showed that during the nullroute a 800Mbps usage.
4- Makes matching the actual switch port with the RTG device id a bit of a
hassle. Has anyone come up with a solution for that?
5- Lacks automation when it comes to discovering new devices. Adding switch
ips is an easy one, but how to run rtg targetmaker.pl every time a new
server is added to a switch, aswell as restarting the RTG process is a more
difficult one.
I am also very interested in knowing wether any big companies are utilizing
RTG for their bandwidth monitoring or billing. I saw in the mailinglist that
Layeredtech, Gnax, Savvis are some examples. Any others that would like to
convince us that RTG is worth it in the long run?
Harry
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.grdata.com/pipermail/rtg/attachments/20090309/614183e6/attachment-0001.htm>
More information about the RTG
mailing list