Skip to content

Ganglia Documents

ihasson edited this page Jul 28, 2014 · 2 revisions

Ganglia Monitoring For Sysadmins

Name Description
gmond The ganglia monitoring daemon (gmond) is a lightweight service that is installed on every machine you'd like to monitor. This daemon uses a simple listen/announce protocol via XDR to collect monitoring state and then shares this information via XML over TCP. Gmond is portable and collects dozens of system metrics: CPU, memory, disk, network and process data.
gmetad The ganglia meta daemon (gmetad) is a service that collects data from other gmetad and gmond sources and stores their state to disk in indexed round-robin databases. Gmetad provides a simple query mechanism for collecting specific information about groups of machines. Gmetad supports hierarchical delegation for creating manageable monitoring domains.
gmetric The ganglia metric tool is a commandline application that you can use to inject custom made metrics about hosts that are being monitored by ganglia. It has the ability to spoof messages as coming from a different host in case you want to capture and report metrics from a device where you don't have gmond running (like a network or other embedded device).
gstat The ganglia stat tool is a commandline application that you can use to query a gmond for metric information directly.
web The ganglia web frontend expresses the data stored by gmetad in a graphical web interface using PHP.

Ganglia Execution Environment

Ganglia can do more than just monitoring too! GEXEC is a scalable cluster remote execution system which provides fast, RSA authenticated remote execution of parallel and distributed jobs. It provides transparent forwarding of stdin, stdout, stderr, and signals to and from remote processes, provides local environment propagation, and is designed to be robust and to scale to systems over thousands of nodes.

In addition, with authd authentication from RocketCalc, allows SSH to sit on top of GEXEC allowing better performance and easier maintenance than SSH alone.

Embedded metrics

You can also embed ganglia into your favorite application using embedded gmetric. This project adds a simple embedded version of gmetric that sends ganglia compatible packets. Right now the C/C++ version only supports unicast. The pure-python version supports unicast and multicast. An alternative solution is to use a scripted interface to libganglia and therefore generate metrics from your own scripted application as suggested by Ganglia::Gmetric for perl applications or to use libganglia directly from your application.

If you can't integrate your application with ganglia to generate metrics and your only option is to parse its logs, you might be interested on ganglia logtailer.

The following applications had already an integration hook for ganglia :

Name Link
hadoop http://wiki.apache.org/hadoop/GangliaMetrics

Wikis, Books, Magazines

There is a great ganglia Wiki on the IBM website. This wiki has great information about running ganglia in IBM platforms (including AIX) and a very graphic description of how the ganglia architecture works.

The book The Linux Enterprise Cluster: Build a Highly Available Cluster with Commodity Hardware and Free Software by Karl Kopper from No Starch Press has a nice chapter on installing and configuring ganglia.

The Grid2 by Ian Foster and Carl Kesselman has basic information about ganglia including screenshots and explanation of how it related to the overall Grid ecosystem.

The August 2003 edition of Linux Magazine published an article on ganglia in their Extreme Linux section.

Jesse Becker wrote an article for Linux Magazine titled Marching Penguins: Monitoring Your HPC Cluster which was published on September 3rd, 2008.

Academic Papers on Ganglia

The Ganglia Distributed Monitoring System: Design, Implementation, and Experience.
Matthew L. Massie, Brent N. Chun, and David E. Culler.
Parallel Computing, Vol. 30, Issue 7, July 2004.

Parallel Computing This original ganglia paper has been cited 266 times according to Google Scholar.

Wide Area Cluster Monitoring with Ganglia
Federico Sacerdoti, Mason Katz, Matt Massie, David E. Culler.
IEEE Cluster 2003 Conference, Hong Kong. December 2003.

IEEE Clusters 2003 This followup ganglia paper has been cited 65 times according to Google Scholar.

Slideshows, Talks and Presentations on Ganglia

Using the CIM statistical model to monitor your data
Presentation given at Management Developers Conference 2007 by Brad Nicholes

Monitoring Your Data Center using Ganglia
Presentation given at ApacheCon 2007 by Brad Nicholes

Brooks Davis (a ganglia developer) was invited to speak on TechTV about FreeBSD clusters. During the 6 minute segment, he was kind enough to plug ganglia and demonstrate the web interface.

Plugin Repositories

We are currently hosting three plugin repositories at GitHub:

The deprecated repository of user-contributed gmetric scripts can still be found at http://ganglia.sourceforge.net/gmetric/ (most of the scripts have already been imported to the Gmetric repository mentioned above).

Report and Modular Graph Documentation

Documentation on using the new (as of 3.1.x) modular graph code.

Other Complimentary Applications

Most other monitoring solutions with the following probably worth a mention as they are usually found alongside ganglia deployments : \

Name Link
nagios http://www.nagios.org/
cacti http://www.cacti.net/

Other Similar Applications

Similar in some ways to ganglia and worth some consideration if for whatever reason ganglia doesn't fit the bill for you : \

Name Link
munin http://munin.projects.linpro.no/
collectd http://collectd.org/

Google Summer of Code

Deploying Ganglia

Attachments