Technical Staff Blog

Category archives: Service Issues

Tstaff announcements related to general IT service issues

RSS feed of Service Issues

Last update on .

Due to campus-wide service outages, the main sections of the CS Dept website, including the home page, have been inaccessible for an extended period of time today, and remain down.  We are monitoring CIS' progress in addressing their issues, and we will ensure that the CS website returns to service when they are resolved.

Last update on .

Common SSH Issues/Fixes

ssh.png

On August 24th, the ssh.cs.brown.edu gateway servers were upgraded to stretch and bound to the CIS OIM domain, ad.brown.edu.  The sunlab and mlab ssh client systems were also upgrade on that day and bound to ad.brown.edu.  As of version 7.0, the openssh server has disabled support for ...

Last update on .

network-2400px.png CIS replaced the 5th floor UPS in the switch closet this morning. Unfortunately, two of the switches failed to boot up again. CIS is working on replacing the failed switches, we will update you with an ETA once CIS provides us an update.

UPDATES:

08:02: CIS estimates it will be another hour before the ...

Last update on .

ibm-gpfs.jpg We are currently experiencing some issues with our GPFS file system, which are causing logins to the departmental systems to hang. Updates to this blog post will be added as we debug the issue.

08:34 -  A file system fsck process appears to be hung. A support call to IBM has been initiated as we ...

Last update on .

whoa-1444580.jpg At about 1am this morning, disk hardware providing a backing store for our VMWare machines went offline. This caused nearly every one of our production servers to go offline as well as all our hosting class machines provisioned for users and research groups. CIS is investigating the issue and we will post updates as we ...

Last update on .

power-outage.jpg We experienced some sort of power blip on Saturday, likely the result of the storms that rolled through. This took out a number of grid machines. The majority of the machines are back and operational again, but obviously any jobs running on the machines have been killed. There are still about two dozen machines we ...

Last update on .

network-2400px.png This was supposed to be a trouble free morning... just power up the switch we moved downstairs yesterday and add our redundancy back in. It seems to be a trend with this project, things not going quite as we planned...

07:14: The second distribution switch was powered on at 7am. Unfortuantely, something happened that ...