System health monitoring
System health monitoring detects errors in the monitored components, so that you can take corrective action to avoid a failure. The following components are monitored on a NetScaler SDX appliance:
- Hardware and software resources
- Physical and virtual disks
- <Hardware sensors, such as fan, temperature, voltage, and power supply sensors
- Interfaces
In the Monitoring tab, click System Health. A summary of all the components is displayed. To view details of the monitored components, expand System Health, and then click the component that you want to monitor.
-
Monitoring the Resources on the SDX Appliance
You can monitor the hardware and software components on the SDX appliance and take corrective action if necessary. To view the components monitored, in the Monitoring tab, expand System Health, and then click Resources. Details are displayed for hardware and software resources. For all hardware components, current and expected values are displayed. For software components, except the BMC firmware version, current and expected values are displayed as not applicable (NA).
- Name: Name of the component, such as CPU, memory, or BMC firmware version.
- Status: State (condition) of the component. For Hardware and for BMC Firmware Version, ERROR indicates a deviation from the expected value. For calls to Citrix Hypervisor, ERROR indicates that the Management Service is unable to communicate with Citrix Hypervisor by using an API, HTTP, PING, or SSH call. For the Health Monitor plug-in, ERROR indicates that the plug-in is not installed on the Citrix Hypervisor.
- Current Value: Current value of the component. In normal conditions, the current value is the same as the expected value.
- Expected Value: Expected value for the component. Does not apply to software calls to Citrix Hypervisor.
Monitor the storage resources on the SDX appliance
You can monitor the disks on the SDX appliance and take corrective action if necessary. To view the components monitored, in the Monitoring tab, expand System Health, and then click Storage. Details are displayed for physical disks and for virtual disks or partitions created from physical disks.
For disks (Disk), the following details are displayed:
- Name The name of the physical disk.
- Size: Size of the disk, in GB.
- Utilized: Amount of data on the disk, in GB.
-
Transactions/s: Number of blocks being read or written per second. This number is read from the
iostat
output. - Blocks Read/s: Number of blocks being read per second. You can use this value to measure the rate of output from the disk.
- Blocks Written/s: Number of blocks being written per second. You can use this value to measure the rate of input to the disk.
- Total Blocks Read: Number of blocks read since the appliance was last started.
- Total Blocks Written: Number of blocks written since the appliance was last started.
For virtual disks or partitions (Storage Repository), the following details are displayed:
- Drive Bay: Number of the drive in the drive bay. You can sort the data on this parameter.
-
Status: State (condition) of the drive in the drive bay. Possible values:
- GOOD: The drive is in a good state and is ready for use.
- FAIL: The drive has failed and must be replaced.
- MISSING: A drive is not detected in the drive bay.
- UNKNOWN: A new unformatted drive exists in the drive bay.
- Name: System defined name of the storage depository.
- Size: Size of the storage repository, in GB.
- Utilized: Amount of data in the storage repository, in GB.
Monitor the hardware Sensors on the SDX appliance
You can monitor the hardware components on the SDX appliance and take corrective action if necessary. In the Monitoring tab, expand System Health, and then click Hardware Sensors. The monitoring function displays details about the speed of different fans, the temperature and voltage of different components, and the status of the power supply.
For fan speed, the following details are displayed:
- Name: Name of the fan.
- Status: State (condition) of the fan. ERROR indicates a deviation from the expected value. NA indicates that the fan is not present.
- Current Value (RPM): Current rotations per minute.
Temperature information includes the following details:
- Name: Name of the component, such as CPU or memory module (for example, P1-DIMM1A.)
- Status: State (condition) of the component. ERROR indicates that the current value is out of range.
- Current Value (Degree C): Current temperature, in degrees, of the component.
Voltage information includes the following details:
- Name: Name of the component, such as CPU core.
- Status: State (condition) of the component. ERROR indicates that the current value is out of range.
- Current Value (Volts): Current voltages present on the component.
Information about the power supply includes the following details:
- Name: Name of the component.
-
Status: State (condition) of the component. Possible values:
- Error: Only one power supply is connected or working.
- OK: Both the power supplies are connected and working as expected.
Monitor the interfaces on the SDX appliance
You can monitor the interfaces on the SDX appliance and take corrective action if necessary. In the Monitoring tab, expand System Health, and then click Interfaces. The monitoring function details the following information about each interface:
- Interface: Interface number on the SDX appliance.
- Status: State of the interface. Possible values: UP, DOWN.
- VFs Assigned/Total: Number of virtual functions assigned to the interface, and the number of virtual functions available on that interface. Different platforms support a different number of VFs.
- Tx Packets: Number of packets transmitted since the appliance was last started.
- Rx Packet: Number of packets received since the appliance was last started.
- Tx Bytes: Number of bytes transmitted since the appliance was last started.
- Rx Bytes: Number of bytes received since the appliance was last started.
- Tx Errors:Number of errors in transmitting data since the appliance was last started.
- Rx Errors: Number of errors in receiving data since the appliance was last started.