Zabbix is now the default monitoring tool for the department’s servers. Users using compute servers can request access to monitor machines resources via Zabbix. Access is recommended for all users of the research machines with large jobs.

The Ganglia and Nagios tools are being phased out now. The License server monitoring tool phase out will occur over the summer.

Yesterday the student sysadmin complained they were getting randomly kicked out of a server. Investigation lead to discovery of an older VM started on another server with the same MAC address. Two machines running with the same IP address was the root of our disconnect issues. Implementing a process where archived VMs have thier MAC addresses changed or removed.