Longjobs -- Service Status Archive |
|
This page summarizes previous Longjobs events and outages.
Tue Nov 8 2005
: Planned maintenance on Wed. Nov. 9, 5pm
-
The longjobs master will be brought down at 5pm Wednesday, Nov. 9, for
a software upgrade. The outage should be brief, but, during that time,
new jobs may not be added to the queue. Jobs already running on an
execution machine will NOT be interrupted. No jobs will be deleted
from the queue.
Tue Sep 20 2005
:
Planned upgrades, beginning Wed. Sep. 21
-
Beginning Wednesday, September 21, execution machines will be taken
offline for software updates. Jobs already running on an execution
machine will NOT be interrupted, and no jobs will be deleted from the
queues, but some of the machines serving the queues will become
unavailable for some periods.
Tue Aug 10 2004
: Planned maintenance on Tue. Aug. 17, 1-5pm
-
The longjobs master will be brought down at 1pm Tuesday, Aug. 17, for
hardware and software upgrades. The work is expected to take up to
4 hours. Jobs may not be added to the queue for the duration of the outage.
Jobs already running on an execution machine will NOT be
interrupted. No jobs will be deleted from the queue.
Tue Aug 03 2004
:
Planned upgrades, beginning Tue. Aug. 10
-
Beginning Tuesday, August 10, execution machines will be taken offline
for hardware and software updates. Jobs already running on an execution
machine will NOT be interrupted, and no jobs will be deleted from the queues,
but some of the machines serving the queues will become unavailable for some
periods.
Sun Aug 03 2003
:
Planned upgrades, beginning Wed. Aug. 6
-
Beginning Wednesday, August 6, execution machines will be taken offline
for software updates. Jobs already running on an execution machine
will NOT be interrupted, and no jobs will be deleted from the queues,
but machines serving the queues will become unavailable for some periods.
Mon Jun 30 2003
:
SGI slaves decommissioned as of July 1 2003
-
As of July 1 2003, the SGI Athena platform is no longer supported.
Accordingly, the SGI machines are being removed from the longjobs
slave pool.
Mon Jan 13 2003, 1:18PM: planned maintenance
- There will be an outage on the longjobs master on Tuesday, Jan 14 at around noon, in order to perform maintenance and install a software update. The work is expected to take about an hour. During that time, users will be unable to submit new jobs, or obtain status on existing jobs.
Tue Aug 27 2002, noon: planned maintenance
Wed Aug 21 2002, 10:00 pm: planned update
Fri Feb 8 2002, 3:00-5:00 pm: planned upgrades
Mon Oct 22 2001, 2:00-2:20 am: planned network outage
-
Beginning at 2:00am, there will be a network outage for campus backbone
work which may affect the longjobs service. The work is expected to be
completed within 20 minutes; during that time, you will not be able to
interact with the service. Jobs which have already started should
continue to run, but they may encounter problems accessing network file
servers during that time. (The outage may also affect the main web
servers, in which case this site would not be available; any updated
information on the outage should be available from the IS Services
Status Page at http://nic.mit.edu/3down.)
Sun Oct 13 2001, 6-9am: planned machine room outage
August 27-31, 2001: planned upgrades, completed Aug 29
- Machines will be taken offline for updates over the course of the week;
running jobs should not be interrupted, but machines serving the queues
will become unavailable for some periods.
Sun Apr 22 2001: planned network outages
- [posted 7pm Thu Apr 19 2001]
From 10:00-10:30am, there will be short network outages during
infrastructure upgrades in w92 which may affect the longjobs service.
The network is expected to lose connectivity for a minute or two, once
or twice within that period; during those times, you will not be able to
interact with the service. Jobs which have already started should
continue to run, but they may encounter problems accessing network file
servers during that time. (The outage will also affect the main web
servers, so this site will not be available; any updated information on
the outage should be available from the IS Services Status Page at http://nic.mit.edu/3down.)
Sun Apr 1 2001: planned network outage
- [posted 6pm Fri Mar 30 2001]
Beginning at noon, there will be an upgrade to network equipment in w92
which may affect the longjobs service. The work is expected to be
completed within 15 minutes; during that time, you will not be able to
interact with the service. Jobs which have already started should
continue to run, but they may encounter problems accessing network file
servers during that time. (The outage will also affect the main web
servers, so this site will not be available; any updated information on
the outage should be available from the IS Services Status Page at http://nic.mit.edu/3down.)
Sun Mar 18 2001: planned network outage occurred
12:00-12:18 pm
- [posted 3pm Sat Mar 17 2001]
Beginning at noon, there will be an upgrade to network equipment which
may affect the longjobs service. The work is expected to be completed
within 20 minutes; during that time, you will not be able to interact
with the service. Jobs which have already started should continue to
run, but they may encounter problems accessing network file servers
during that time. (The outage will also affect the main web servers, so
this site will not be available; any updated information on the outage
should be available from the IS Services Status Page at http://nic.mit.edu/3down.)
- [posted 1pm Sun Mar 18 2001]
The previously announced
network upgrade work has been completed; it lasted from noon until
approximately 12:18pm. Jobs which had already started continued to run,
but output should be checked for missing or inconsistent data.
(If your
job tried to access network file servers during the outage, the
stderror
file may contain messages such as "Connection timed out".)
Mar 1 2001: test began
Last modified: Thu, Jan 2 2003