Skip to content
CELS Virtual Helpdesk

CELS Virtual Helpdesk

  • Systems Group
  • Blog
  • Documentation

CELS Virtual Helpdesk

CELS Shared Services Systems Group

Documentation Search

Search for:

Most Recent Dispatch

  • Network Firewall Migrations, Wednesday April 30th 5PM – 8PM

Site search

Critical NFS Bug: Potential Data Loss

March 29, 2022 by Stacey, Craig

There is a critical bug on our AIX NFS servers (gorilla, kong and magilla) serving MCS home directories and MCS project data. This bug is encountered only on our newest linux desktops, a list of which appears at the bottom of this note. Specifical...

There is a critical bug on our AIX NFS servers (gorilla, kong and magilla) serving MCS home directories and MCS project data.  This bug is encountered only on our newest linux desktops, a list of which appears at the bottom of this note.Specifically, the bug is encountered when group writable files are accessed on one of these machines by someone other than the owner.  In the event this occurs, the user will get an error, but more importantly the file will be zeroed out (i.e. replaced with an empty file).If you have used one of the machines listed you will want to double check your files to ensure you haven’t accidentally lost data due to this bug.  If you have, let us know so we can restore as soon as possible.  In the meantime, make sure any network file writes are done from a known safe machine, such as terra, shakey, harley, triumph, elephant, crunch, smash or schwinn.  If your desktop machine is not listed below, it is also safe.  You can type “whatami” from a linux terminal as well — if the output is linux-debian_3.1-ia32, that machine is safe.IBM has issued a fix for this bug, however in order to be able to apply this to our servers, we would need to upgrade the full operating system on them.  The amount of downtime associated with this would be unacceptable.  Instead, we will be taking a fast track to get one of the new Solaris file servers online and migrate all NFS shares over to that.  We have confirmed this bug is not present in Solaris.This is our top priority task.  I can’t give an exact time frame at this point, but I will promise a status update tomorrow (6/27).  This is a fast-track emergency solution — we’ll deploy a more elegant solution once this fire’s out.My sincere apologies for this.  If you’ve lost data because of this, we’ll do all we can to get it back.The list of affected machines:bucco.mcs.anl.govcifaretto.mcs.anl.govcontra.mcs.anl.govcsi334378.mcs.anl.govdarth.mcs.anl.goveffable.mcs.anl.govgarth.mcs.anl.govgnep.mcs.anl.govhaines-desktop.mcs.anl.govhookshot.mcs.anl.govhorikawa-cph.mcs.anl.govjoy-3-06p4-cph.mcs.anl.govkant.mcs.anl.govkirby.mcs.anl.govkschoche-desktop.mcs.anl.govlikli-desktop.mcs.anl.govlucky[0-6].mcs.anl.govluigi.mcs.anl.govlust-cph.mcs.anl.govlust-cph.mcs.anl.govmsulliva-desktop.mcs.anl.govnehebkau.mcs.anl.govnoah.mcs.anl.govoctagon.mcs.anl.govoctopus.mcs.anl.govopteron-ibm.mcs.anl.govpaulie.mcs.anl.govpiano.mcs.anl.govredline.mcs.anl.govroberts-desktop.mcs.anl.govrouxamd64.mcs.anl.govseed-linux-1.mcs.anl.govsmithy.mcs.anl.govsson-desktop.mcs.anl.govstrat.mcs.anl.govstrength.mcs.anl.govwayne.mcs.anl.gov

Post navigation

Previous Post:

Chiba City, R.I.P.

Next Post:

Update

Leave a Reply

You must be logged in to post a comment.

Helpful links

  • Service Catalog
  • Request…
    • a domain name
    • a GCE Unix Group
    • an IP Address
    • a Laptop Build
    • a loaner laptop
    • a JIRA project
    • a Mailing List
    • an Overleaf account
    • a port activation
    • a poster print
    • a reactivation for a returning user
    • an upgrade to Slack Business Plus from Free.
    • a WordPress migration
    • a WordPress site
    • an xgitlab or gitlab migration
    • a Zoom license upgrade

Previous Dispatches

Search Documentation

Search for:

Privay & Security Notice

Privacy & Security Notice

Site tools

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
© 2025 CELS Virtual Helpdesk | WordPress Theme by Superbthemes