The HECToR Service is now closed and has been superceded by ARCHER.

HECToR Monthly Report, Phase 2B system, July 2010

Information on the utilisation, disk allocations, slowdowns and helpdesk statistics can be found in the associated SAFE monthly report.

Dates covered: 08:00 1 July 2010 to 08:00 1 August 2010
Number of hours: 744

1: Availability

Scheduled down time: 11 hours 24 minutes.

Incidents

The following incidents were recorded:

SeverityNumber
16
20
319
40

Details of severity level 1 incidents

ID Date Description Length Attribution
Incident-4021 07/07/2010 Maintenance session over-run 09:04 Cray
Incident-4031 08/07/2010 Shutdown due to Lustre problem 06:12 Cray
Incident-4066 17/07/2010 High Speed Network failure 02:09 Cray
Incident-4111 20/07/2010 Filesystem mount error 01:15 Cray
Incident-4176 28/07/2010 OSS failure 01:38 Cray
Incident-4191 30/07/2010 Link Inactive 01:12 Cray

MTBF and Serviceability

AttributionFailuresMTBFUDTServiceability
Cray612221:30:0097.1%
Site000:00:00100%
External000:00:00100%
Other000:00:00100%
Overall612221:30:0097.1%
  • Note 1: Serviceability%= 100*(WCT-SDT-UDT)/(WCT-SDT)
  • Note 2: MTBF (Mean Time Between Failures) is defined as 732/Number of failures.

2: Performance Statistics

Technology Provision

Description Value
Technology reliability 97.1%
Technology throughput 8647 hours
Capability job completion rate 100%
Technology MTBF 122

Note: Technology throughput is calculated: 12*(732-UDT-SDT); 732 - annual average number of hours in a month

Note: MTBF is calculated as 732/number of failures