Skip to main content

Storage Area Network Quarterly Report - Q1 2009

Report Period: January 2009 - March 2009

General Service Availability

There were no observed or reported breaks in service availability to host servers during this period. See "Other Information" at the end of this page.

SAN Storage details

Summary

 

TOTAL

Array1

Array2

Array3

Array4

Raw capacity total

173.54TB

61.02TB

38.76TB

23.42TB

50.35TB

Raw capacity allocated

148.57TB

55.3TB

30.88TB

23.42TB

38.98TB

Raw capacity unused

24.97TB

5.73TB

7.88TB

0TB

11.37TB

 

 

     

 

Volume groups allocated

109.79TB

45.26TB

23.43TB

17.17TB

30.02TB

Parity/Hot Spare allocated

32.71TB

10.04TB

7.45TB

6.26TB

8.97TB

 

 

     

 

Disks used

526

176

101

114

135

Disks spare

73

14

23

0

36

 

 

     

 

Storage Partitions licensed

 

16

64

16

16

Storage Partitions used

 

14

9

5

12

Storage Partitions available

 

2

55

11

4

 

 

     

 

Tray spaces

40

0

0

19

21

 

 

Departmental / group usage summary


SAN controller drive loop utilisation

    Drive channels Spaces left
    A1/A2 A3/A4 B1/B2 B3/B4 Loop 1 Loop 2
Array 1 CSM trays 2 3 3 2 0 0
  Old trays 5 4 4 5
Array 2 CSM trays 2 3 3 2 3 0
  Old trays 2 4 4 2
Array 3 CSM trays 3 0 0 3 4 0
  Old trays 0 8 8 0
Array 4 CSM trays 3 0 0 3 4 0
  Old trays 0 8 8 0

 

Storage Network (fabric) details

 

Fibre Channel port usage

Site Fabric A Fabric B
In use Available In use Available
UH Machine Room 23 24 22 26
CSC Machine Room 6 18 0 24
Bio Sciences 0 14 0 0
Maths (Backup) 0 14 0 0
WBS Machine Room 8 16 0 24
ITS Machine Room 8 14 8 16
Westwood Test Lab - - - -
Total 45 100 30 90

 

Inter-switch link (ISL) utilisation

Statistics not yet available

 

General SAN storage service

Errors / faults / Incidents

Fault Fix Impact Callout Count Fix time
Exchange mirror desynchronisation Manual None No 4 Variable
Read error Auto None No 5 n/a
Volume communication failure Auto None No 2 ~120 sec
Drive failure Replace None Yes 1 3 hour
Controller reset Auto Unknown No 1 103 sec
Cache battery lifetime reached Replace None Yes 1 26 days

 

Service requests

Request Count
New volumes from existing storage 11
Volumes decommissioned 2
Volumes extended 1
Volumes reconfigured  
Quotation for new storage (In progress) 31

Response times not recorded to date

All requests must be logged via HEAT and also noted in storage service logs

1 The storage has been received and is scheduled for install w/c 20.04.2009

 

Significant changes this quarter

Four new fabric switches added.

  • Two 32-port Brocade 5100 switches (production fabric) University House
  • Two 24-port Brocade 300 switches (test / pre-production) Westwood test lab
  • Switches awaiting cross-campus fibre runs.
  • Sun 5320 storage gateway installed and undergoing initial testing, specifically iSCSI tests.

 

Significant changes planned for the future

Move of UOW03 (Exchange) storage node to new University House switches

  • not essential, but preferred for future expansion
  • may not require service downtime
  • will require brief pause in data mirror replication
  • requires consultation with and assistance from Sun
  • expected date 2009Q2

 

Move of WBS storage node to CSC Machine Room

  • requires approximately one day downtime for some systems (mainly CSC)
  • requires consultation with and assistance from Sun
  • expected date 2009Q2

 

Upgrade of storage controllers to latest firmware

  • requires adequate backup of all data
  • involves significant downtime for all attached systems.
  • approximately 2 hours per array
  • likely split over two days
  • expected date Summer 2009

 

Other information

Sunday 29.03.2009, 02:30

Storage array UOWARRAY01 restarted. This process took 103 seconds. There were no reports of service unavailability, and the storage controllers are designed to run each other's workload during this process. This incident is therefore not believed to have caused a loss of availability to any host server.

103 seconds in 90 days constitutes 0.13% of total service time. Arrays 2,3 and 4 had no known breaks in service for this quarter.