Usually when I take large chunks of time away from the lab the servers get sad and Jeff has to deal with some extra sysadmin chaos beyond the usual grind we both deal with day to day. This time they kindly waited for me to get back.

The BOINC web/alpha project server went kaput on Monday, the day I returned. This wasn't the worst tragedy, as all I had to do was reinstall the OS. However during that process one of the non-OS filesystems in the machine got screwed up. Classic linux software RAID behavior: failing for no apparent reason, and instead of recovering gracefully like it should it gets into a funny state that makes recovery impossible. As I griped about in the past: linux software RAID is really only good for organized, faster storage, with the side benefit of maybe, on very rare occasions, actually protecting the data. The cons for linux software RAID is that it will eventually go bonkers and ruin your day.

Fine. I rebuilt the RAID, and we have daily backups of the filesystem in question (which happens to hold the entire BOINC web site). Well, turns out the lab-wide backup system was also having problems. Talk about bad timing. Long story short we had to wait 48 hours for the backup system to get back on line, and only just now am I recovering the web site. It should be back later this afternoon in some form or another (I hope).

- Matt-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

Do have a time scale for the next scheduler rebuild? Until a properly fixed scheduler is deployed Anonymous platform hosts without flops values get the same <rsc_fpops_est> values for CPU and GPU apps, and so DCF goes all over the place,
(Richard Haselgrove asked on Boinc_Dev: 'Not being a skilled code reader, may I ask if changeset 24385's "improve the accuracy of FLOPS estimation for GPU apps" applies to anonymous platform?', But he never got an answer from DA.)

I believe there are still gremlins running amok in the servers. I tried to log onto both BOINC and SETI and had to revert to using the authenticator string for SETI. BOINC is not recognizing either my regular login info (No account with email address xxxxx@xxxxx.net exists. Please go back and try again.) nor the authenticator string. On the BOINC download page (http://boinc.berkeley.edu/download.php) the following is displayed - Download BOINC, 6.12.34 for Windows 64-bit (0.00 MB).

Thank you for your response, and yes, you are correct about BOINC. I am in the process of (bouncing off the walls) trying to setup our first home network, and while trying to download the current BOINC software mistakenly tried to log in to the BOINC message boards after the download page displayed the link size as 0.00 MB. Again, danke!

Hi, sorry to add to your woes, but uploads are very slow, and the Scarecrow graphs have flatlined since 2300 UTC yesterday (4 Nov, 2011)

Scarecrow's graphs have flatlined because the souce data (the XML version of the server status page) hasn't updated since 4 Nov 2011 | 21:20:07 UTC - the staff are aware of this, but it's the weekend and still very early morning in Berkeley.

Hi, sorry to add to your woes, but uploads are very slow, and the Scarecrow graphs have flatlined since 2300 UTC yesterday (4 Nov, 2011)

Scarecrow's graphs have flatlined because the source data (the XML version of the server status page) hasn't updated since 4 Nov 2011 | 21:20:07 UTC - the staff are aware of this, but it's the weekend and still very early morning in Berkeley.

Refer to the discussion threads in Number Crunching.

I know the time in Berkeley - I live 15 minutes by car north of the SSL - I was posting that so that someone could look into it when they arrived.