Viewing Peripheral Device Controller Status

To view the status of each controller, from the Main Menu, choose "view and edit Peripheral devices View Peripheral Device Status."

A table displays the status of the available peripheral devices.

FIGURE 12-1 View Peripheral Device Status

Viewing SES Status (FC and SATA Only)

A Fibre Channel array's SCSI Enclosure Services (SES) processor is located on the I/O module. The SES processor monitors chassis-based environmental conditions such as temperature sensor readings, cooling fans status, the beeper speaker condition, power supplies, and slot status. The SES processor is supported by Sun StorEdge Configuration Service and the Sun StorEdge CLI. These chassis sensors are separate from the controller sensors described in Viewing Controller Voltage and Temperature Status.

For Sun StorEdge 3510 FC JBOD arrays and Sun StorEdge 3511 SATA JBOD arrays, both Sun StorEdge Configuration Service and the Sun StorEdge CLI access the SES processor using device files in /dev/es, such as /dev/es/ses0, as shown in the following example.

# sccli

Available devices:

1. /dev/rdsk/c4t0d0s2 [SUN StorEdge 3310 SN#000280] (Primary)

2. /dev/es/ses0 [SUN StorEdge 3510F D SN#00227B] (Enclosure)

To Check the Status of SES Components (FC and SATA Only)

1. From the Main Menu, choose "view and edit Peripheral devices View Peripheral Device Status SES Device" to display a list of environmental sensors and other hardware components of that SES device.

2. Select an item from the list and press Return to display information about it or see a list of its component attributes.

Choosing Overall Status, as in the illustration above, displays the status of the SES device and its operating temperature.

Overall status of an SES device is reported independently from the status of the individual components of that device. An SES device showing an overall status in the menu has its own sensors that report its overall status and its overall temperature.

3. Select other attributes in which you are interested and press Return to learn more about the SES device.

Selecting the Element Descriptor in the following illustration displays the descriptive name of the element.

In this case the descriptor is Disk Drives.

Identifying Fans (FC and SATA Only)

You can view the status of SES components, including the pair of fans located in each power supply module. A fan is identified in the SES Device menus as a cooling element.

To View the Status of Each Fan

In some cases you have to "drill down" to display information about components, as as shown in the following illustrations. The following series of screens provide the fan (cooling element) status for each fan.

2. Choose one of the elements (element 0, 1, 2, or 3).

Normal fan speeds are indicated by numbers 1 through 7, indicating speeds in the normal range of 4000 to 6000 RPM. The number 0 indicates that the fan has stopped.

TABLE 12-1 Fan Status and Fan Speeds

Fan Status

Fan RPM

0 Fan stopped

0 - 3999

1 Fan at lowest speed

4000 - 4285

2 Fan at second lowest speed

4286 - 4570

3 Fan at speed 3

4571 - 4856

4 Fan at speed 4

4857 - 5142

5 Fan at speed 5

5143 - 5428

6 Fan at speed at intermediate speed

5429 - 5713

7 Fan at highest speed

5714 to maximum

If a fan fails and the Status field does not display the OK value, you must replace the power supply module and fan.

Cooling elements in the status table can be identified for replacement as shown in TABLE 12-2. Cooling fan locations are identified in FIGURE 12-2.

SES Temperature Sensor Locations (FC and SATA Only)

Monitoring temperature at different points within the array is one of the most important SES functions. High temperatures can cause significant damage if they go unnoticed. There are a number of different sensors at key points in the enclosure. The following table shows the location of each of those sensors. The element ID corresponds to the identifier shown when you choose "view and edit Peripheral devices View Peripheral Device Status SES Device Temperature Sensors."

Note - Press the down arrow to access an element ID that is not currently displayed in the displayed list of sensors.

TABLE 12-3 Temperature Sensor Locations (FC and SATA)

Element ID

Description

0

Drive Midplane Left Temperature Sensor #0

1

Drive Midplane Left Temperature Sensor #1

2

Drive Midplane Center Temperature Sensor #2

3

Drive Midplane Center Temperature Sensor #3

4

Drive Midplane Right Temperature Sensor #4

5

Drive Midplane Right Temperature Sensor #5

6

Upper I/O Module (IOM) Left Temperature Sensor #6

7

Upper I/O Module (IOM) Left Temperature Sensor #7

8

Lower I/O Module (IOM) Temperature Sensor #8

9

Lower I/O Module (IOM) Temperature Sensor #9

10

Left Power Supply Temperature Sensor #10

11

Right Power Supply Temperature Sensor #11

SES Voltage Sensors (FC and SATA Only)

Voltage sensors make sure that the array's voltages are within normal ranges. The voltage components differ for the Sun StorEdge 3510 FC array and the Sun StorEdge 3511 SATA array.

SES Power Supply Sensors (FC and SATA Only)

Each Sun StorEdge 3510 FC array and Sun StorEdge 3511 SATA array has two fully redundant power supplies, with load-sharing capabilities. The sensors monitor the voltage, temperature and fan units in each power supply.

TABLE 12-6 Power Supply Sensors (FC and SATA)

Element ID

Description

Location

Alarm Condition

0

Left Power Supply 0

Left viewed from the rear

Voltage, temperature, or fan fault

1

Right Power Supply 1

Right viewed from the rear

Voltage, temperature, or fan fault

Viewing Peripheral Device SAF-TE Status (SCSI Only)

A SCSI array's SAF-TE processor is located on the SCSI I/O module. It controls environmental monitoring of SAF-TE devices contained in the chassis such as temperature sensors, cooling fans, the beeper speaker, power supplies, and slot status. These chassis sensors are separate from the controller sensors described in Viewing Controller Voltage and Temperature Status.

To Check the Status of SAF-TE Components (SCSI Only)

The temperature sensor displays the current temperature of each sensor in degrees Fahrenheit.

When a drive slot is filled, the drive slot row displays a SCSI ID number.

In a single-bus configuration, ID numbers 0 through 5 and 8 through 13 are shown if all 12 drives are filled (SCSI IDs 6 and 7 are reserved for host communication). Wherever a slot is empty, the message "No Device Inserted" is displayed. See FIGURE 12-3.

FIGURE 12-3 Example of the SAF-TE Device Status Window for a Single-Bus Configuration

The SAF-TE protocol does not support a split-bus configuration and recognizes only one bus (half the drives) if you have a split-bus configuration. As a result, in a 12-drive split-bus configuration you see the message "Unknown" for six drives on one channel, but you see the ID numbers for the six drives on the other channel, as shown in FIGURE 12-4.

Identifying Fans (SCSI Only)

You can view the status of SAF-TE components, including the pair of fans located in each power supply module. A pair of fans is identified in the SAF-TE Device Status window as Cooling Fan 0 or Cooling Fan 1.

If a fan fails and the Status field does not display the Operational value, you must replace the power supply module and fan.

Cooling elements in the status table can be identified for replacement as shown in TABLE 12-2. Cooling fan locations are identified in FIGURE 12-5.

TABLE 12-7 Location of Cooling Fans

Cooling Element #

Fan # and Power Supply Module #

Cooling Fan 0

FANS 0 AND 1, PS 0

Cooling Fan 1

FAN 2 AND FAN3, PS 1

FIGURE 12-5 Cooling Fan Locations

SAF-TE Temperature Sensor Locations (SCSI Only)

Monitoring temperature at different points within the array is one of the most important SAF-TE functions. High temperatures can cause significant damage if they go unnoticed. There are a number of different sensors at key points in the enclosure. The following table shows the location of each of those sensors. The Element ID corresponds to the identifier shown when you choose "view and edit Peripheral devices View Peripheral Device Status SAF-TE Device."

TABLE 12-8 Temperature Sensor Locations (SCSI)

Temp Sensor ID

Description

0

Port A Drive Midplane Temperature #1

1

Port A Drive Midplane Temperature #2

2

Port A Power Supply Temperature #1 (PS 0)

3

Port B EMU Temperature #1 (left module as seen from back)

4

Port B EMU Temperature #2 (right module as seen from back)

5

Port B Drive Midplane Temperature #3

6

Port B Power Supply Temperature #2 (PS 1)

CPU Temperature

CPU on Controller

Board1 Temperature

Controller

Board2 Temperature

Controller

SAF-TE Power Supply Sensors (SCSI Only)

Each Sun StorEdge 3310 SCSI array and Sun StorEdge 3320 SCSI array has two fully redundant power supplies, with load sharing capabilities. The sensors monitor the voltage, temperature and fan units in each power supply.

1. From the Main Menu, choose "view and edit Peripheral devices Set Peripheral Device Entry Redundant Controller - Primary" to display the following message.

Deassert Reset on Failed Controller ?

2. Choose Yes to restore the controller that you previously force-failed.

3. Allow several minutes for the failed controller to come back online.

The following message notifies you when the controller is back online:

Controller Default Write Policy Restored

Event Trigger Operations

Event trigger operations configure an array so that it dynamically switches from write-back-enabled to write-back-disabled (write-through) if a specified failure occurs or threshold is exceeded. Once the problem is corrected, the original write policy is restored.

This change affects the write policy of all logical drives except those whose individual policy has been changed to override the global default write policy for the array.

Except for the "Temperature exceeds threshold -" menu option, these trigger operations toggle between being enabled and being disabled each time you change the setting.

Configuring the Controller Failure Event Trigger

If the array has been configured with the write-back cache mode enabled, enable this menu option if you want the array to automatically revert to write-through cache mode (write-back disabled) if one controller in a dual controller array fails.

If the array has been configured with the write-back cache mode enabled, enable this menu option if you want the array to automatically revert to write-through cache mode (write-back disabled) if an array's battery backup fails or falls below its lower threshold.

To Enable or Disable the BBU Low Event or BBU Failed Event Trigger

From the Main Menu, choose "view and edit Peripheral devices Set Peripheral Device Entry Event Trigger Operations BBU Low or Failed," and choose Yes to confirm the change.

Configuring the Power Supply Failed Event Trigger

If the array has been configured with the write-back cache mode enabled, enable this menu option if you want the array to automatically revert to write-through cache mode (write-back disabled) if one of the array's power supplies fails.

To Enable or Disable the Power Supply Failed Event Trigger

From the Main Menu, choose "view and edit Peripheral devices Set Peripheral Device Entry Event Trigger Operations Power Supply Failed," and choose Yes to confirm the change.

Configuring the Fan Failure Event Trigger

If the array has been configured with the write-back cache mode enabled, enable this menu option if you want the array to automatically revert to write-through cache mode (write-back disabled) if one of the array's cooling fans fails.

To Enable or Disable the Fan Failure Event Trigger

From the Main Menu, choose "view and edit Peripheral devices Set Peripheral Device Entry Event Trigger Operations Fan Failure," and choose Yes to confirm the change.

Configuring the Temperature Exceeds Threshold Event Trigger

The "Temperature exceeds threshold -" menu option differs from other event triggers. It forces a controller shutdown---rather than merely a change in cache policy--if a temperature is detected that exceeds system threshold limits. You can adjust this setting to shut down the controller as soon as the temperature limit is exceeded, or after a delay ranging from two minutes to an hour, or disable the controller shutdown entirely. Choose Enable for an immediate shutdown after the upper threshold limit is exceeded, or choose Disable if you want no trigger for this event. Otherwise, select the time intervals you want to elapse after the threshold is exceeded before the controller shutdown takes place.

2. Select the option or interval you want, and then choose Yes to confirm your choice.

Operating in a NEBS Environment

Sun StorEdge 3000 family products are NEBS Class 3-certified. When this equipment is installed and operated in a NEBS-III or other environment that potentially requires the equipment to be operated outside of the normal temperature range, the Over-Temp Controller Shutdown function must be disabled (this includes testing for such operation).

The sensors responsible for shutdown (if enabled) are: Board #1 (85C), Board #2 (85C - and the most likely to cause the shutdown, especially when on the IOM in the upper slot), and CPU (95C).

Note - Operating this equipment outside of the normal temperature range can adversely affect the operational lifetime of the equipment. The severity of the effect depends on the severity of the overtemp condition and length of time it persists.

In addition, there is a thermal switch in each power supply that shuts down the 5VDC & 12VDC current when the switch reaches 95C. This thermal switch cannot be directly monitored, bypassed, or defeated.

Ensure that an air gap exists between RAID controllers and any other equipment in the rack. Otherwise, if two devices make physical contact with each other, thermal conduction between the units can result in higher than expected operating temperatures.

Caution - The controller sensor settings for your array have been optimized for safe and reliable operations. You might see some menu options that include the word "Default," which refers only to default firmware settings for a variety of hardware products that use this firmware. However, these menu options are not necessarily the default settings for your array. Do not change any voltage or temperature threshold parameters unless specifically advised to do so by service personnel.

To Display Controller Voltage and Temperature Status

The components checked for voltage and temperature are displayed and defined as normal or out of order.

To View or Configure Thresholds

Caution - Equipment damage can result from running equipment outside of normal operating conditions. Do not change any voltage or temperature threshold parameters unless specifically advised to do so by service personnel.