MXes out of sync with Ringmaster

13022007

A colleague of mine has been called in to look at a problem for a customer where Ringmaster appears to lose part of the configuration and gets out of sync with the some of the MXes on the network. But when you try to add the missing config back in, it complains that it is already there…

When looking in the local user database, no MAC-auth users are to be found, but when you telnet to the MX concerned, the MAC users exist. In the local and network changes columns (before you do a deploy) it reports that there are no changes either in Ringmaster or on the MX itself. And yet the configs appear to be out of sync. It is also not possible to cut and paste the MAC users from a good MX to the problem one – if you try to do this it tells you that the config already exists!

The work-around is to delete the MXes from the plan, and to re-upload them into Ringmaster. But who knows – maybe it could happen again. I was there on Friday and no such condition existed as far as I could see.

The MXes are all running MSS 4.2.X. Not sure of the exact version number. We’re wanting to go to version 5 but there was reportedly some kind of memory leak discovered in that version.