In this section:
Hardware alarms are generated on the AMC348 I/O card to indicate the status of local components on the card.
The alarms are accessed from the alarm dashboard in the Web UI. The alarm dashboard provides an indication of system status by showing the number of critical, major, and minor alarms that are generated by system events. Individual alarms that appear in the dashboard are accessed to determine the origin of the alarm. To access the alarms, see section Alarm Dashboard Web User Interface in the Alarms Guide. To view details of individual alarms, see section View Alarms.
The hardware alarms are generated by hardware sensors on the AMC348. There are two types of hardware sensors on the DSC 8000:
Threshold sensors and the corresponding SNMP threshold alarms are described in the following sections:
Discrete sensors and the corresponding SNMP alarms are described in the following sections:
Alarm details provided in the Web UI include the hardware sensor's Portal ID and IPMI number, which are dynamically assigned. The ID assignments change when a card is removed or when a new card is inserted in the DSC 8000 chassis; therefore, are not used to identify a hardware alarm.
All threshold sensors available on the AMC348 I/O card generate alarms. Threshold sensors are monitored by the MMC (sensor IDs are local to the MMC) and trigger an SNMP alarm when a threshold sensor event occurs. Threshold sensor events consist of voltage or temperature values crossing a pre-defined threshold level.
The following table describes the hardware sensors on the AMC348 I/O cards.
IPMI Sensor Name | Alias in HWMON | Description | Units | SNMP Alarms | Alarm Event |
---|---|---|---|---|---|
12V | 12V | 12V voltage sensor on the AMC348 card. | Volts | 6346 - 6351 | Alarms generated on critical, major, and minor lower threshold crossings. |
MGMT 3.3V | MGMT 3.3V | 3.3V Management (IPMI) Power sensor on the AMC348 card. | Volts | 6346 - 6351 | Alarms generated on critical, major, and minor lower threshold crossings. |
Temp | Intake Temp | A temperature sensor on the AMC348 that monitors the airflow temperature at the point of coolest air intake on the card. | C Degrees | 6348 - 6351 | Alarms on generated on critical and major lower threshold crossings (minor crossings excluded in 15.0) |
Mid Board Temp | A temperature sensor on the AMC348 that monitors the airflow temperature at the mid-point on the card. |
Threshold sensor event severity levels are defined as follows:
A threshold sensor triggers an SNMP alarm when a pre-defined voltage threshold level is crossed by the monitored voltage.
The following table shows voltage threshold levels for voltage sensors on AMC348 cards.
Sensor Name | Lower Non-Recoverable Threshold (Alarm 6350) | Lower Critical Threshold (Alarm 6348) | Lower Non-Critical Threshold (Alarm 6346) | Upper Non-Critical Threshold (Alarm 6344) | Upper Critical Threshold (Alarm 6342) | Upper Non-Recoverable Threshold (Alarm 6340) |
---|---|---|---|---|---|---|
MGMT 3.3V | NA | 3.0V | 3.068V | 3.533V | 3.6V | 3.8V |
12V | 9.0V | 10.0V | 10.8V | 13.2V | 14.0V | 15.0V |
A threshold sensor triggers an SNMP alarm when a monitored temperature value crosses a pre-defined temperature threshold level.
The following table shows temperature threshold levels for the temperature sensors on AMC348 I/O cards.
Sensor Name | Units | Lower Non-Recoverable Threshold (Alarm 6350) | Lower Critical Threshold (Alarm 6348) | Lower Non-Critical Threshold (Alarm 6346) | Upper Non-Critical Threshold (Alarm 6344) | Upper Critical Threshold (Alarm 6342) | Upper Non-Recoverable Threshold (Alarm 6340) |
---|---|---|---|---|---|---|---|
Intake Temp | C Degrees | NA | NA | NA | 60C | 80C | NA |
Mid Board | C Degrees | NA | NA | NA | 60C | 80C | NA |
Some versions of DSC software generate minor temperature alarms when the monitored temperature value crosses the Upper Non-Critical (UNC) temperature threshold. Minor temperature threshold crossing events are required for stable operation of the cooling sub-system and the resulting alarms are ignored.
To reduce the occurrence of minor alarms, a modified temperature monitoring function is introduced in DSC software Release 15.0. Temperature alarms are only generated when the monitored temperature value crosses the Upper-Critical (UC) and Upper-Non-Recoverable (UNR) temperature thresholds generating major and critical alarms, respectively.
The following table lists SNMP alarms that are registered when a threshold sensor event occurs on the AMC348 I/O card.
SNMP Alarm Number | Alarm Name | Clearing Alarm |
---|---|---|
6340 | 6341 | |
6341 | N/A | |
6342 | 6343 | |
6343 | N/A | |
6344 | 6345 | |
6345 | N/A | |
6346 | 6347 | |
6347 | N/A | |
6348 | 6349 | |
6349 | N/A | |
6350 | 6351 | |
6351 | N/A |
Discrete sensors return values of 'on' and 'off' or 'true' and 'false'. Each entity in the system has a 'Version Change' sensor that reports the entity's FRU state. These states are described in Intelligent Platform Management Interface Specification Second Generation (v2.0) specification.
The following table describes the discrete hardware sensors on the AMC348 I/O card.
IPMI Sensor Name | Alias in HWMON | Description | SNMP Alarms | Alarm Event |
---|---|---|---|---|
Watchdog | BMC Watchdog | Internal watchdog timer fired. | N/A | No alarm is generated on this event. |
Hot-swap | Hot Swap_AMC3481 | The hot-swap sensor for the AMC348 I/O card. | On card extraction, an alarm is raised on detection of state transitions:
On card insertion, an alarm is raised (clearing) on detection of state transition:
| |
Power Good | POWER GOOD | A PICMG boolean sensor (false = 1, true = 2) that indicates whether or not the power subsystem on the AMC348 I/O card is healthy. | A transition to 'false' (Power Not Good) is detected. A transition to 'true' (Power Good) is detected. | |
Version | Version Change2 | This sensor reports the FRU state on the AMC348 I/O card (MMC/IPMC firmware is changed). The sensor returns a one bit value assigned to eight possible FRU conditions. For example, if bit 0 is set, the condition defined by the value of 00h is present. | N/A | Informational sensor only. No alarms are generated. |
1 Each entity in the system has a hot-swap sensor that reports the entity's FRU state. These states are described in PICMG® 3.0 AdvancedTCA® Base Specification. The sensor returns a one bit value for each of the eight states, M0 - M7, as defined in the specification. For example, if bit 0 is set, the FRU is in state M0. Similarly, if bit 4 is set, the sensor returns a value of 16 (0001000b), which is the Normal (Active) state, M4.
The state values include:
[7] – 1b: FRU Operational State M7 = Communication Lost
[6] – 1b: FRU Operational State M6 = FRU Deactivation In Progress
[5] – 1b: FRU Operational State M5 = FRU Deactivation Request
[4] – 1b: FRU Operational State M4 = FRU Active
[3] – 1b: FRU Operational State M3 = FRU Activation in Progress
[2] – 1b: FRU Operational State M2 = FRU Activation Request
[1] – 1b: FRU Operational State M1 = FRU Inactive
[0] – 1b: FRU Operational State M0 = FRU Not Installed
2 Each entity in the system has a 'Version Change' sensor that reports the entity's FRU state.These states are described in Intelligent Platform Management Interface Specification Second Generation (v2.0) specification. The sensor returns a one bit value assigned to eight possible FRU conditions. For example, if bit 0 is set, then the condition defined by the value of 00h is present. The eight conditions include the following:
00h: hardware change detected (informational). This offset does not indicate whether the hardware change was successful or not, only that a change occurred.
01h: firmware or software change detected (informational).
02h: hardware incompatibility detected
03h: firmware or software incompatibility detected
04h: entity has an invalid or unsupported hardware version
05h: entity contains an invalid or unsupported firmware or software version
06h: hardware change detected on entity was successful (de-assertion event = unsuccessful)
07h: software or firmware change detected on entity was successful (de-assertion event = unsuccessful)
The following table lists SNMP alarms that are registered when a discrete sensor event occurs on the AMC348 I/O card.