News
Recent News
Date & Time | Details |
09/22/2020 15:41 EDT | BGFS Maintenance Window Extension: https://cwa.rc.usf.edu/news/405
Hello, Due to issues encountered during the planned maintenance on BGFS, the maintenance window has been extended for an additional 24 hours. Service is expected to be restored by 10 AM EDT on September 24, 2020, and RC administrators will provide updates as available. |
09/18/2020 10:28 EDT | This serves as the final reminder of our previously posted BGFS news: https://cwa.rc.usf.edu/news/400
|
09/16/2020 9:06 EDT | This serves as a reminder of our previously posted BGFS news: https://cwa.rc.usf.edu/news/400
|
09/10/2020 9:34 EDT | This serves as a reminder of our previously posted BGFS news: https://cwa.rc.usf.edu/news/400
|
09/04/2020 15:37 EDT | On Wednesday, September 3, 2020 at approximately 08:41 EDT Research Computing administrators noticed a discrepancy in the file system and took administrative action to correct the issue.
However, Research Computing monitoring software and file system logs indicate that several more metadata consistency issues were observed, and 3 resyncs were automatically attempted by the file system management software - which failed. Research Computing administrators intervened and did not observe any errors logged within the system. To ensure that the file system was in a clean state, a manual resync was initiated at approximately 15:28 EDT and which completed at 16:26 EDT with errors logged. At this point, action was taken across the cluster to ensure that no intensive I/O would be present on the file system. A decision was made to terminal all jobs and temporarily disable Samba/CIFS (Windows/Mac networked drives via \\cifs-pgs.rc.usf.edu) on some shares. Unfortunately, another metadata processing issue was reported via system logs, resulting in an automatic resync process being initiated sometime around 17:16 EDT. The standard start messages weren't present in the logs, which is concerning itself. Due to this instability, Research Computing administrators contacted the vendor for assistance. Research Computing was then instructed to disable certain features of the file system causing the issue, in an effort to restore connectivity and access to user data. Disabling these features ensured that further issues wouldn't occur again. Per the vendor's recommendation, Research Computing administrators restored access to Samba/CIFS (Windows/Mac networked drives via \\cifs-pgs.rc.usf.edu) at 17:27 EDT and access to /work_bgfs and /shares_bgfs at 17:56 EDT the via the computational cluster. The vendor has advised Research Computing that we must apply a patch to restore complete functionality, and it will require a reformat of all disks associated with the system. Research Computing will perform this work beginning September 21, 2020 at 10:00 EDT. This work will only affect the /work_bgfs scratch space. As you know, this space is not backed-up and is considered volatile storage [0]. This work does not affect /shares_bgfs, but during the reformat the data will be inaccessible. Any questions and/or comments can be sent to rc-help@usf.edu. [0] https://wiki.rc.usf.edu/index.php/CIRCE_Data_Archiving#Work_Directory_.28.2Fwork_or_.2Fwork_bgfs.29 |