VMware ESXi Hardware mit Nagios oder Icinga überwachen

Aus Thomas-Krenn-Wiki
Zur Navigation springen Zur Suche springen
Hinweis: Bitte beachten Sie, dass dieser Artikel / diese Kategorie sich entweder auf ältere Software/Hardware Komponenten bezieht oder aus sonstigen Gründen nicht mehr gewartet wird.
Diese Seite wird nicht mehr aktualisiert und ist rein zu Referenzzwecken noch hier im Archiv abrufbar.
Systemstatus im vSphere Client: Informationen zum LSI RAID Controller werden vom CIM-Provider bereitgestellt. Serversensoren wie RAM-Temperatur, Lüfter-Drehzahlen oder Netzteil-Status werden per IPMI von VMware abgefragt.
Icinga Warnung nach dem Ausfall einer Festplatte in einem Server mit X9SCM-F Mainboard.

VMware vSphere 6.7, VMware vSphere 6.5, VMware vSphere 6.0, vSphere 5.5, vSphere 5.1 und vSphere 5.0 bieten eine integrierte Überwachung der Hardwarekomponenten eines Servers. Den Status dieser Komponenten prüft VMware durch bereits in VMware eingebaute Checks (z.B. für IPMI Sensoren) sowie entsprechender CIM Provider, etwa für Hardware RAID Controller.

Das Plugin check_esxi_hardware.py ermöglicht die einfache Überwachung des Hardware Systemstatus mittels Nagios oder Icinga.

Anforderungen CIM Provider

Der CIM Provider muss die Informationen zum Hardwarestatus an ESXi weitergeben. Dies ist beispielsweise beim CIM Provider für MegaRAID Controller der Fall:

Hinweis: Der CIM Provider für Adaptec RAID Controller eignet sich dazu nicht (siehe Adaptec RAID Controller in VMware überwachen - Installation CIM Provider und aacraid Treiber)

Hier geht es zu unseren VMware-Server-Systemen im Onlineshop von Thomas-Krenn

Plugin

Das Plugin steht auf folgender Webseite zum Download bereit:

Informationen auf exchange.nagios.org:

Verwendung des Plugins

Die Funktionsweise des Plugins wurde erstmalig von uns mit einem Thomas-Krenn-Server mit einem X8DT3 Mainboard getestet. Auf diesem Server wurde ESXi 5.1 mit integriertem LSI CIM Provider installiert (wird von Thomas Krenn im Download Bereich zur Verfügung gestellt). Dadurch kann auch der Status des LSI 9260-4i RAID Controllers überwacht werden. Die letzten Tests wurden mit einem Supermicro X10 Mainboard, einem MegaRAID 9341-4i mit ESXi 6.5 durchgeführt.

Für die Verwendung des Plugins muss Python sowie die Library pywbem installiert sein. Unter Debian/Ubuntu kann diese mittels

apt-get install python-pywbem

nachinstalliert werden.

Danach kann das Plugin auf der Kommandozeile getestet werden.

Die wichtigsten Parameter des Plugins sind:

  • -H ... IP Adresse des VMware ESXi Servers
  • -U ... Username oder Pfad zur Username-Passwort Datei (file:/path/to/.file)
  • -P ... Passwort oder Pfad zur Passwort-Datei (file:/path/to/.file)
  • -v ... verbose, zeigt alle Sensoren an die abgefragt werden

Zum Testen verwenden wir den root User vom ESXi Server. In einem produktiven Umfeld sollte am vCenter Server ein eigener Benutzer angelegt werden, der nur die Berechtigung hat, die Sensoren auszulesen.

Das Plugin kann wie folgt aufgerufen werden:

python check_esxi_hardware.py -H 10.X.X.X -U root -P password
WARNING : Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i)  WARNING : Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i) - \
Server: Supermicro X8DT3 s/n: 1234567890 System BIOS: 2.0a 2010-09-14
echo $?
1

In diesem Fall kommt eine Warnung, da die RAID Controller Batterie (BBU) noch nicht vollständig geladen ist.

Auf der Webseite des Plugins wird empfohlen das Passwort in einer Datei anzugeben. Dadurch scheint das Passwort nicht in der Prozessliste auf, während der Check ausgeführt wird. Es gibt dafür zwei Varianten.

  • nur das Passwort in der Datei angeben
    • python check_esxi_hardware.py -H 10.X.X.X -U root -P file:/path/to/.file
  • Username und Passwort durch Leerzeichen getrennt in der Datei angeben
    • python check_esxi_hardware.py -H 10.X.X.X -U file:/path/to/.file -P file:/path/to/.file

Interessant ist auch die Verwendung der Option "-v". Dadurch werden alle abgefragten Sensoren sowie deren Status Code angezeigt. Beispiel mit ESXi 5.1:

python check_esxi_hardware.py -H 10.1.102.143 -U tkmon -P relation -v
20130430 09:29:33 Connection to https://10.1.102.143
20130430 09:29:33 Check classe OMC_SMASHFirmwareIdentity
20130430 09:29:33   Element Name = System BIOS
20130430 09:29:33     VersionString = 2.0a
20130430 09:29:33 Check classe CIM_Chassis
20130430 09:29:33   Element Name = Chassis
20130430 09:29:33     Manufacturer = Supermicro
20130430 09:29:33     SerialNumber = 1234567890
20130430 09:29:33     Model = X8DT3
20130430 09:29:33     Element Op Status = 0
20130430 09:29:33 Check classe CIM_Card
20130430 09:29:34   Element Name = Motherboard
20130430 09:29:34     Element Op Status = 0
20130430 09:29:34 Check classe CIM_ComputerSystem
20130430 09:29:34   Element Name = System Board 7:1
20130430 09:29:34     Element Op Status = 0
20130430 09:29:34   Element Name = localhost
20130430 09:29:34   Element Name = Hardware Management Controller (Node 0)
20130430 09:29:34     Element Op Status = 0
20130430 09:29:34   Element Name = Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i)
20130430 09:29:34     Element Op Status = 3
20130430 09:29:34 GLobal exit set to WARNING
20130430 09:29:34 Check classe CIM_NumericSensor
20130430 09:29:35   Element Name = Memory Device 12 P2-DIMM3B Temp
20130430 09:29:35     sensorType = 2 - Temperature
20130430 09:29:35     BaseUnits = 2
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 42.000000
20130430 09:29:35     Lower Threshold Non Critical = -5.000000
20130430 09:29:35     Upper Threshold Non Critical = 75.000000
20130430 09:29:35     Lower Threshold Critical = -7.000000
20130430 09:29:35     Upper Threshold Critical = 80.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Memory Device 11 P2-DIMM3A Temp
20130430 09:29:35     sensorType = 2 - Temperature
20130430 09:29:35     BaseUnits = 2
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 45.000000
20130430 09:29:35     Lower Threshold Non Critical = -5.000000
20130430 09:29:35     Upper Threshold Non Critical = 75.000000
20130430 09:29:35     Lower Threshold Critical = -7.000000
20130430 09:29:35     Upper Threshold Critical = 80.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Memory Device 10 P2-DIMM2B Temp
20130430 09:29:35     sensorType = 2 - Temperature
20130430 09:29:35     BaseUnits = 2
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 39.000000
20130430 09:29:35     Lower Threshold Non Critical = -5.000000
20130430 09:29:35     Upper Threshold Non Critical = 75.000000
20130430 09:29:35     Lower Threshold Critical = -7.000000
20130430 09:29:35     Upper Threshold Critical = 80.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Memory Device 9 P2-DIMM2A Temp
20130430 09:29:35     sensorType = 2 - Temperature
20130430 09:29:35     BaseUnits = 2
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 41.000000
20130430 09:29:35     Lower Threshold Non Critical = -5.000000
20130430 09:29:35     Upper Threshold Non Critical = 75.000000
20130430 09:29:35     Lower Threshold Critical = -7.000000
20130430 09:29:35     Upper Threshold Critical = 80.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Memory Device 8 P2-DIMM1B Temp
20130430 09:29:35     sensorType = 2 - Temperature
20130430 09:29:35     BaseUnits = 2
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 39.000000
20130430 09:29:35     Lower Threshold Non Critical = -5.000000
20130430 09:29:35     Upper Threshold Non Critical = 75.000000
20130430 09:29:35     Lower Threshold Critical = -7.000000
20130430 09:29:35     Upper Threshold Critical = 80.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Memory Device 7 P2-DIMM1A Temp
20130430 09:29:35     sensorType = 2 - Temperature
20130430 09:29:35     BaseUnits = 2
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 39.000000
20130430 09:29:35     Lower Threshold Non Critical = -5.000000
20130430 09:29:35     Upper Threshold Non Critical = 75.000000
20130430 09:29:35     Lower Threshold Critical = -7.000000
20130430 09:29:35     Upper Threshold Critical = 80.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Fan Device 8 Fan8
20130430 09:29:35     sensorType = 5 - Tachometer
20130430 09:29:35     BaseUnits = 19
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 1890.000000
20130430 09:29:35     Lower Threshold Non Critical = 675.000000
20130430 09:29:35     Upper Threshold Non Critical = 34155.000000
20130430 09:29:35     Lower Threshold Critical = 540.000000
20130430 09:29:35     Upper Threshold Critical = 34290.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Fan Device 7 Fan7
20130430 09:29:35     sensorType = 5 - Tachometer
20130430 09:29:35     BaseUnits = 19
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 1890.000000
20130430 09:29:35     Lower Threshold Non Critical = 675.000000
20130430 09:29:35     Upper Threshold Non Critical = 34155.000000
20130430 09:29:35     Lower Threshold Critical = 540.000000
20130430 09:29:35     Upper Threshold Critical = 34290.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Fan Device 5 Fan5
20130430 09:29:35     sensorType = 5 - Tachometer
20130430 09:29:35     BaseUnits = 19
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 945.000000
20130430 09:29:35     Lower Threshold Non Critical = 675.000000
20130430 09:29:35     Upper Threshold Non Critical = 34155.000000
20130430 09:29:35     Lower Threshold Critical = 540.000000
20130430 09:29:35     Upper Threshold Critical = 34290.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Fan Device 2 Fan2
20130430 09:29:35     sensorType = 5 - Tachometer
20130430 09:29:35     BaseUnits = 19
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 1080.000000
20130430 09:29:35     Lower Threshold Non Critical = 675.000000
20130430 09:29:35     Upper Threshold Non Critical = 34155.000000
20130430 09:29:35     Lower Threshold Critical = 540.000000
20130430 09:29:35     Upper Threshold Critical = 34290.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = Fan Device 1 Fan1
20130430 09:29:35     sensorType = 5 - Tachometer
20130430 09:29:35     BaseUnits = 19
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 945.000000
20130430 09:29:35     Lower Threshold Non Critical = 675.000000
20130430 09:29:35     Upper Threshold Non Critical = 34155.000000
20130430 09:29:35     Lower Threshold Critical = 540.000000
20130430 09:29:35     Upper Threshold Critical = 34290.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 VBAT
20130430 09:29:35     sensorType = 3 - Voltage
20130430 09:29:35     BaseUnits = 5
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 3.240000
20130430 09:29:35     Lower Threshold Non Critical = 2.920000
20130430 09:29:35     Upper Threshold Non Critical = 3.640000
20130430 09:29:35     Lower Threshold Critical = 2.900000
20130430 09:29:35     Upper Threshold Critical = 3.670000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 +12V
20130430 09:29:35     sensorType = 3 - Voltage
20130430 09:29:35     BaseUnits = 5
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 12.080000
20130430 09:29:35     Lower Threshold Non Critical = 10.700000
20130430 09:29:35     Upper Threshold Non Critical = 13.250000
20130430 09:29:35     Lower Threshold Critical = 10.650000
20130430 09:29:35     Upper Threshold Critical = 13.300000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 +5V
20130430 09:29:35     sensorType = 3 - Voltage
20130430 09:29:35     BaseUnits = 5
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 5.020000
20130430 09:29:35     Lower Threshold Non Critical = 4.480000
20130430 09:29:35     Upper Threshold Non Critical = 5.530000
20130430 09:29:35     Lower Threshold Critical = 4.440000
20130430 09:29:35     Upper Threshold Critical = 5.560000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 +3.3VSB
20130430 09:29:35     sensorType = 3 - Voltage
20130430 09:29:35     BaseUnits = 5
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 3.240000
20130430 09:29:35     Lower Threshold Non Critical = 2.920000
20130430 09:29:35     Upper Threshold Non Critical = 3.640000
20130430 09:29:35     Lower Threshold Critical = 2.900000
20130430 09:29:35     Upper Threshold Critical = 3.670000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 +3.3V
20130430 09:29:35     sensorType = 3 - Voltage
20130430 09:29:35     BaseUnits = 5
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 3.280000
20130430 09:29:35     Lower Threshold Non Critical = 2.920000
20130430 09:29:35     Upper Threshold Non Critical = 3.640000
20130430 09:29:35     Lower Threshold Critical = 2.900000
20130430 09:29:35     Upper Threshold Critical = 3.670000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 +1.5V
20130430 09:29:35     sensorType = 3 - Voltage
20130430 09:29:35     BaseUnits = 5
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 1.520000
20130430 09:29:35     Lower Threshold Non Critical = 1.330000
20130430 09:29:35     Upper Threshold Non Critical = 1.650000
20130430 09:29:35     Lower Threshold Critical = 1.320000
20130430 09:29:35     Upper Threshold Critical = 1.660000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 CPU2 DIMM
20130430 09:29:35     sensorType = 3 - Voltage
20130430 09:29:35     BaseUnits = 5
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 1.580000
20130430 09:29:35     Lower Threshold Non Critical = 1.190000
20130430 09:29:35     Upper Threshold Non Critical = 1.640000
20130430 09:29:35     Lower Threshold Critical = 1.190000
20130430 09:29:35     Upper Threshold Critical = 1.650000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 CPU2 Vcore
20130430 09:29:35     sensorType = 3 - Voltage
20130430 09:29:35     BaseUnits = 5
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 1.040000
20130430 09:29:35     Lower Threshold Non Critical = 0.820000
20130430 09:29:35     Upper Threshold Non Critical = 1.350000
20130430 09:29:35     Lower Threshold Critical = 0.810000
20130430 09:29:35     Upper Threshold Critical = 1.360000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35   Element Name = System Board 1 System Temp
20130430 09:29:35     sensorType = 2 - Temperature
20130430 09:29:35     BaseUnits = 2
20130430 09:29:35     Scaled by = 0.010000 
20130430 09:29:35     Current Reading = 36.000000
20130430 09:29:35     Lower Threshold Non Critical = -5.000000
20130430 09:29:35     Upper Threshold Non Critical = 75.000000
20130430 09:29:35     Lower Threshold Critical = -7.000000
20130430 09:29:35     Upper Threshold Critical = 77.000000
20130430 09:29:35     Element Op Status = 2
20130430 09:29:35 Check classe CIM_Memory
20130430 09:29:35   Element Name = CPU 2 Level-1 Cache
20130430 09:29:35     Element Op Status = 0
20130430 09:29:35   Element Name = CPU 2 Level-2 Cache
20130430 09:29:35     Element Op Status = 0
20130430 09:29:35   Element Name = CPU 2 Level-3 Cache
20130430 09:29:35     Element Op Status = 0
20130430 09:29:35   Element Name = Memory
20130430 09:29:35 Check classe CIM_Processor
20130430 09:29:36   Element Name = CPU 2
20130430 09:29:36     Family = 179
20130430 09:29:36     CurrentClockSpeed = 1866MHz
20130430 09:29:36     Element Op Status = 2
20130430 09:29:36 Check classe CIM_RecordLog
20130430 09:29:36 Check classe OMC_DiscreteSensor
20130430 09:29:36   Element Name = Power Supply 1 PS Status: Failure status
20130430 09:29:36     Element Op Status = 2
20130430 09:29:36   Element Name = System Chassis 1 Intrusion: General Chassis intrusion
20130430 09:29:36     Element Op Status = 2
20130430 09:29:36   Element Name = Processor 2 CPU2 Temp
20130430 09:29:36 Check classe OMC_Fan
20130430 09:29:37   Element Name = Fan8
20130430 09:29:37     Element Op Status = 2
20130430 09:29:37   Element Name = Fan7
20130430 09:29:37     Element Op Status = 2
20130430 09:29:37   Element Name = Fan5
20130430 09:29:37     Element Op Status = 2
20130430 09:29:37   Element Name = Fan2
20130430 09:29:37     Element Op Status = 2
20130430 09:29:37   Element Name = Fan1
20130430 09:29:37     Element Op Status = 2
20130430 09:29:37 Check classe OMC_PowerSupply
20130430 09:29:37   Element Name = Power Supply 1
20130430 09:29:37     Element Op Status = 2
20130430 09:29:37 Check classe VMware_StorageExtent
20130430 09:29:38   Element Name = Drive 252_5 on controller 500605B00418BB20 Fw: n/a - UNCONFIGURED GOOD
20130430 09:29:38     Element Op Status = 2
20130430 09:29:38   Element Name = Drive 252_4 on controller 500605B00418BB20 Fw: n/a - UNCONFIGURED GOOD
20130430 09:29:38     Element Op Status = 2
20130430 09:29:38 Check classe VMware_Controller
20130430 09:29:38   Element Name = Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i)
20130430 09:29:38     Element Op Status = 3
20130430 09:29:38 GLobal exit set to WARNING
20130430 09:29:38 Check classe VMware_StorageVolume
20130430 09:29:39   Element Name = RAID 1 StorageVolume Logical Volume 500605B00418BB20_0 on controller 500605B00418BB20, Drives( - OPTIMAL
20130430 09:29:39     Element Op Status = 2
20130430 09:29:39 Check classe VMware_Battery
20130430 09:29:39   Element Name = Battery 934 on Controller 500605B00418BB20
20130430 09:29:39     Element Op Status = 11
20130430 09:29:39 Check classe VMware_SASSATAPort
20130430 09:29:39   Element Name = Port 0 on Controller 500605B00418BB20
20130430 09:29:39     Element Op Status = 2
20130430 09:29:39   Element Name = Port 1 on Controller 500605B00418BB20
20130430 09:29:39     Element Op Status = 2
 WARNING : Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i)  WARNING : Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i) -\
 Server: Supermicro X8DT3 s/n: 1234567890 System BIOS: 2.0a 2010-09-14

Und ein Beispiel mit ESXi 6.5:

tkmon@tkmon:~$ /usr/lib/nagios/plugins/check_esxi_hardware.py --host=10.2.1.169 --user=root --pass=********** --verbose
20170823 22:13:15 Connection to https://10.2.1.169
20170823 22:13:15 Found pywbem version 0.8.0-dev
20170823 22:13:15 Check classe OMC_SMASHFirmwareIdentity
20170823 22:13:15   Element Name = System BIOS
20170823 22:13:15     VersionString = 2.0a
20170823 22:13:15 Check classe CIM_Chassis
20170823 22:13:15   Element Name = Chassis
20170823 22:13:15     Manufacturer = Supermicro
20170823 22:13:15     SerialNumber = 0123456789
20170823 22:13:15     Model = Super Server
20170823 22:13:15     Element Op Status = 0
20170823 22:13:15 Check classe CIM_Card
20170823 22:13:16   Element Name = Motherboard
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16 Check classe CIM_ComputerSystem
20170823 22:13:16   Element Name = System Board 7:1
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:2
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:3
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:12
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:15
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:17
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:18
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:19
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:20
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:21
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:32
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = System Board 7:33
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = localhost.intern.thomas-krenn.com
20170823 22:13:16   Element Name = Hardware Management Controller (Node 0)
20170823 22:13:16     Element Op Status = 0
20170823 22:13:16   Element Name = Controller 500605B00CDBC930 (LSI MegaRAID SAS 9341-4i)
20170823 22:13:16     Element Op Status = 2
20170823 22:13:16 Check classe CIM_NumericSensor
20170823 22:13:18   Element Name = System Board 12 1.05V PCH
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.050000
20170823 22:13:18     Lower Threshold Non Critical = 0.940000
20170823 22:13:18     Upper Threshold Non Critical = 1.190000
20170823 22:13:18     Lower Threshold Critical = 0.890000
20170823 22:13:18     Upper Threshold Critical = 1.220000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 21 1.2V BMC
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.210000
20170823 22:13:18     Lower Threshold Non Critical = 1.090000
20170823 22:13:18     Upper Threshold Non Critical = 1.340000
20170823 22:13:18     Lower Threshold Critical = 1.040000
20170823 22:13:18     Upper Threshold Critical = 1.370000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 20 1.5V PCH
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.500000
20170823 22:13:18     Lower Threshold Non Critical = 1.400000
20170823 22:13:18     Upper Threshold Non Critical = 1.640000
20170823 22:13:18     Lower Threshold Critical = 1.340000
20170823 22:13:18     Upper Threshold Critical = 1.670000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 19 3.3VSB
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 3.240000
20170823 22:13:18     Lower Threshold Non Critical = 2.950000
20170823 22:13:18     Upper Threshold Non Critical = 3.550000
20170823 22:13:18     Lower Threshold Critical = 2.820000
20170823 22:13:18     Upper Threshold Critical = 3.650000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 15 5VSB
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 4.920000
20170823 22:13:18     Lower Threshold Non Critical = 4.480000
20170823 22:13:18     Upper Threshold Non Critical = 5.390000
20170823 22:13:18     Lower Threshold Critical = 4.290000
20170823 22:13:18     Upper Threshold Critical = 5.540000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Device 4 VDIMMGH
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.200000
20170823 22:13:18     Lower Threshold Non Critical = 1.040000
20170823 22:13:18     Upper Threshold Non Critical = 1.340000
20170823 22:13:18     Lower Threshold Critical = 0.970000
20170823 22:13:18     Upper Threshold Critical = 1.420000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Device 3 VDIMMEF
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.200000
20170823 22:13:18     Lower Threshold Non Critical = 1.040000
20170823 22:13:18     Upper Threshold Non Critical = 1.340000
20170823 22:13:18     Lower Threshold Critical = 0.970000
20170823 22:13:18     Upper Threshold Critical = 1.420000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Device 2 VDIMMCD
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.200000
20170823 22:13:18     Lower Threshold Non Critical = 1.040000
20170823 22:13:18     Upper Threshold Non Critical = 1.340000
20170823 22:13:18     Lower Threshold Critical = 0.970000
20170823 22:13:18     Upper Threshold Critical = 1.420000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Device 1 VDIMMAB
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.200000
20170823 22:13:18     Lower Threshold Non Critical = 1.040000
20170823 22:13:18     Upper Threshold Non Critical = 1.340000
20170823 22:13:18     Lower Threshold Critical = 0.970000
20170823 22:13:18     Upper Threshold Critical = 1.420000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Processor 4 Vcpu2
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.800000
20170823 22:13:18     Lower Threshold Non Critical = 1.390000
20170823 22:13:18     Upper Threshold Non Critical = 1.890000
20170823 22:13:18     Lower Threshold Critical = 1.260000
20170823 22:13:18     Upper Threshold Critical = 2.080000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Processor 3 Vcpu1
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 1.800000
20170823 22:13:18     Lower Threshold Non Critical = 1.390000
20170823 22:13:18     Upper Threshold Non Critical = 1.890000
20170823 22:13:18     Lower Threshold Critical = 1.260000
20170823 22:13:18     Upper Threshold Critical = 2.080000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 18 VBAT
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 2.790000
20170823 22:13:18     Lower Threshold Non Critical = 2.500000
20170823 22:13:18     Upper Threshold Non Critical = 3.670000
20170823 22:13:18     Lower Threshold Critical = 2.430000
20170823 22:13:18     Upper Threshold Critical = 3.780000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 32 3.3VCC
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 3.350000
20170823 22:13:18     Lower Threshold Non Critical = 2.950000
20170823 22:13:18     Upper Threshold Non Critical = 3.550000
20170823 22:13:18     Lower Threshold Critical = 2.820000
20170823 22:13:18     Upper Threshold Critical = 3.650000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 33 5VCC
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 5.000000
20170823 22:13:18     Lower Threshold Non Critical = 4.480000
20170823 22:13:18     Upper Threshold Non Critical = 5.390000
20170823 22:13:18     Lower Threshold Critical = 4.290000
20170823 22:13:18     Upper Threshold Critical = 5.540000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 17 12V
20170823 22:13:18     sensorType = 3 - Voltage
20170823 22:13:18     BaseUnits = 5
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 12.120000
20170823 22:13:18     Lower Threshold Non Critical = 10.740000
20170823 22:13:18     Upper Threshold Non Critical = 12.940000
20170823 22:13:18     Lower Threshold Critical = 10.290000
20170823 22:13:18     Upper Threshold Critical = 13.260000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Fan Device 5 FAN5
20170823 22:13:18     sensorType = 5 - Tachometer
20170823 22:13:18     BaseUnits = 19
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 3000.000000
20170823 22:13:18     Lower Threshold Non Critical = 700.000000
20170823 22:13:18     Upper Threshold Non Critical = 25300.000000
20170823 22:13:18     Lower Threshold Critical = 500.000000
20170823 22:13:18     Upper Threshold Critical = 25400.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Fan Device 2 FAN2
20170823 22:13:18     sensorType = 5 - Tachometer
20170823 22:13:18     BaseUnits = 19
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 2300.000000
20170823 22:13:18     Lower Threshold Non Critical = 700.000000
20170823 22:13:18     Upper Threshold Non Critical = 25300.000000
20170823 22:13:18     Lower Threshold Critical = 500.000000
20170823 22:13:18     Upper Threshold Critical = 25400.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Device 84 P2-DIMMF1 Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 34.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 80.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 85.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Device 80 P2-DIMME1 Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 36.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 80.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 85.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Device 68 P1-DIMMB1 Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 37.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 80.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 85.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Device 64 P1-DIMMA1 Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 39.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 80.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 85.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Module 6 VmemGHVRM Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 36.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 95.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 100.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Module 5 VmemEFVRM Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 44.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 95.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 100.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Module 4 VmemCDVRM Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 38.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 95.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 100.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Module 3 VmemABVRM Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 39.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 95.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 100.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Module 2 Vcpu2VRM Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 44.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 95.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 100.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Memory Module 1 Vcpu1VRM Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 49.000000
20170823 22:13:18     Lower Threshold Non Critical = 5.000000
20170823 22:13:18     Upper Threshold Non Critical = 95.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 100.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 2 Peripheral Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 48.000000
20170823 22:13:18     Lower Threshold Non Critical = 0.000000
20170823 22:13:18     Upper Threshold Non Critical = 80.000000
20170823 22:13:18     Lower Threshold Critical = -5.000000
20170823 22:13:18     Upper Threshold Critical = 85.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 1 System Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 35.000000
20170823 22:13:18     Lower Threshold Non Critical = 0.000000
20170823 22:13:18     Upper Threshold Non Critical = 80.000000
20170823 22:13:18     Lower Threshold Critical = -5.000000
20170823 22:13:18     Upper Threshold Critical = 85.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = System Board 3 PCH Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 50.000000
20170823 22:13:18     Lower Threshold Non Critical = 16.000000
20170823 22:13:18     Upper Threshold Non Critical = 90.000000
20170823 22:13:18     Lower Threshold Critical = 5.000000
20170823 22:13:18     Upper Threshold Critical = 95.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Processor 2 CPU2 Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 39.000000
20170823 22:13:18     Lower Threshold Non Critical = 0.000000
20170823 22:13:18     Upper Threshold Non Critical = 85.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 90.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = Processor 1 CPU1 Temp
20170823 22:13:18     sensorType = 2 - Temperature
20170823 22:13:18     BaseUnits = 2
20170823 22:13:18     Scaled by = 0.010000
20170823 22:13:18     Current Reading = 37.000000
20170823 22:13:18     Lower Threshold Non Critical = 0.000000
20170823 22:13:18     Upper Threshold Non Critical = 85.000000
20170823 22:13:18     Lower Threshold Critical = 0.000000
20170823 22:13:18     Upper Threshold Critical = 90.000000
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18 Check classe CIM_Memory
20170823 22:13:18   Element Name = CPU1 Level-1 Cache
20170823 22:13:18     Element Op Status = 0
20170823 22:13:18   Element Name = CPU1 Level-2 Cache
20170823 22:13:18     Element Op Status = 0
20170823 22:13:18   Element Name = CPU1 Level-3 Cache
20170823 22:13:18     Element Op Status = 0
20170823 22:13:18   Element Name = CPU2 Level-1 Cache
20170823 22:13:18     Element Op Status = 0
20170823 22:13:18   Element Name = CPU2 Level-2 Cache
20170823 22:13:18     Element Op Status = 0
20170823 22:13:18   Element Name = CPU2 Level-3 Cache
20170823 22:13:18     Element Op Status = 0
20170823 22:13:18   Element Name = Memory
20170823 22:13:18 Check classe CIM_Processor
20170823 22:13:18   Element Name = CPU1
20170823 22:13:18     Family = 179
20170823 22:13:18     CurrentClockSpeed = 1700MHz
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18   Element Name = CPU2
20170823 22:13:18     Family = 179
20170823 22:13:18     CurrentClockSpeed = 1700MHz
20170823 22:13:18     Element Op Status = 2
20170823 22:13:18 Check classe CIM_RecordLog
20170823 22:13:19   Element Name = IPMI SEL
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19 Check classe OMC_DiscreteSensor
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: General Chassis intrusion
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Drive Bay intrusion
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: I/O Card area intrusion
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Processor area intrusion
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: System unplugged from LAN
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unauthorized dock
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: FAN area intrusion
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unknown
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unknown
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unknown
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unknown
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unknown
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unknown
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unknown
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = System Chassis 1 Chassis Intru: Unknown
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19 Check classe OMC_Fan
20170823 22:13:19   Element Name = FAN5
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19   Element Name = FAN2
20170823 22:13:19     Element Op Status = 2
20170823 22:13:19 Check classe OMC_PowerSupply
20170823 22:13:20 Check classe VMware_StorageExtent
20170823 22:13:20   Element Name = Drive 62_4 on controller 500605B00CDBC930 Fw: 1V02 - ONLINE
20170823 22:13:20     Element Op Status = 2
20170823 22:13:20   Element Name = Drive 62_5 on controller 500605B00CDBC930 Fw: 1V02 - ONLINE
20170823 22:13:20     Element Op Status = 2
20170823 22:13:20 Check classe VMware_Controller
20170823 22:13:20   Element Name = Controller 500605B00CDBC930 (LSI MegaRAID SAS 9341-4i)
20170823 22:13:20     Element Op Status = 2
20170823 22:13:20 Check classe VMware_StorageVolume
20170823 22:13:21   Element Name = RAID 1 Data Logical Volume 500605B00CDBC930_0 on controller 500605B00CDBC930, Drives( - OPTIMAL
20170823 22:13:21     Element Op Status = 2
20170823 22:13:21 Check classe VMware_Battery
20170823 22:13:21 Check classe VMware_SASSATAPort
20170823 22:13:21   Element Name = Port 0 on Controller 500605B00CDBC930
20170823 22:13:21     Element Op Status = 2
20170823 22:13:21   Element Name = Port 1 on Controller 500605B00CDBC930
20170823 22:13:21     Element Op Status = 2
OK - Server: Supermicro Super Server s/n: 0123456789 System BIOS: 2.0a 2016-08-25
tkmon@tkmon:~$

Beispielausgabe unter ESXi 6.7, hier werden aktuell keine Daten eines MegaRAID Controllers ausgegeben. Der verfügbare SMIS Provider funktioniert unter ESXi 6.7 noch nicht korrekt:

tkmon@tkmon:~$ /usr/lib/nagios/plugins/check_esxi_hardware.py --host=10.1.102.52 --user=root --pass=Relation1234$ --verbose
20180731 14:37:40 Connection to https://10.1.102.52
20180731 14:37:40 Found pywbem version 0.8.0-dev
20180731 14:37:40 Check classe OMC_SMASHFirmwareIdentity
20180731 14:37:40   Element Name = System BIOS
20180731 14:37:40     VersionString = 2.0a
20180731 14:37:40 Check classe CIM_Chassis
20180731 14:37:41   Element Name = Chassis
20180731 14:37:41     Manufacturer = Supermicro
20180731 14:37:41     SerialNumber = 0123456789
20180731 14:37:41     Model = X9SCL-II/X9SCM-II
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41 Check classe CIM_Card
20180731 14:37:41   Element Name = Motherboard
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41 Check classe CIM_ComputerSystem
20180731 14:37:41   Element Name = System Board 7:1
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = System Board 7:2
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = System Board 7:17
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = System Board 7:18
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = System Board 7:32
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = System Board 7:33
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = System Board 7:34
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = System Board 7:35
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = System Board 7:36
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41   Element Name = localhost.tdz.thomas-krenn.com
20180731 14:37:41   Element Name = Hardware Management Controller (Node 0)
20180731 14:37:41     Element Op Status = 0
20180731 14:37:41 Check classe CIM_NumericSensor
20180731 14:37:42   Element Name = System Board 36 AVCC
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 3.370000
20180731 14:37:42     Lower Threshold Non Critical = 2.940000
20180731 14:37:42     Upper Threshold Non Critical = 3.580000
20180731 14:37:42     Lower Threshold Critical = 2.880000
20180731 14:37:42     Upper Threshold Critical = 3.640000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = System Board 35 VSB
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 3.310000
20180731 14:37:42     Lower Threshold Non Critical = 2.940000
20180731 14:37:42     Upper Threshold Non Critical = 3.580000
20180731 14:37:42     Lower Threshold Critical = 2.880000
20180731 14:37:42     Upper Threshold Critical = 3.640000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = System Board 18 VBAT
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 3.050000
20180731 14:37:42     Lower Threshold Non Critical = 2.940000
20180731 14:37:42     Upper Threshold Non Critical = 3.580000
20180731 14:37:42     Lower Threshold Critical = 2.880000
20180731 14:37:42     Upper Threshold Critical = 3.640000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = System Board 34 -12V
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = -12.090000
20180731 14:37:42     Lower Threshold Non Critical = -13.450000
20180731 14:37:42     Upper Threshold Non Critical = -10.930000
20180731 14:37:42     Lower Threshold Critical = -13.650000
20180731 14:37:42     Upper Threshold Critical = -10.740000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = System Board 33 5VCC
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 5.020000
20180731 14:37:42     Lower Threshold Non Critical = 4.570000
20180731 14:37:42     Upper Threshold Non Critical = 5.340000
20180731 14:37:42     Lower Threshold Critical = 4.320000
20180731 14:37:42     Upper Threshold Critical = 5.600000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = Memory Device 1 VDIMM
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 1.500000
20180731 14:37:42     Lower Threshold Non Critical = 1.280000
20180731 14:37:42     Upper Threshold Non Critical = 1.760000
20180731 14:37:42     Lower Threshold Critical = 1.210000
20180731 14:37:42     Upper Threshold Critical = 1.770000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = System Board 17 12V
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 12.080000
20180731 14:37:42     Lower Threshold Non Critical = 10.700000
20180731 14:37:42     Upper Threshold Non Critical = 13.090000
20180731 14:37:42     Lower Threshold Critical = 10.600000
20180731 14:37:42     Upper Threshold Critical = 13.190000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = System Board 32 3.3VCC
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 3.370000
20180731 14:37:42     Lower Threshold Non Critical = 2.940000
20180731 14:37:42     Upper Threshold Non Critical = 3.580000
20180731 14:37:42     Lower Threshold Critical = 2.880000
20180731 14:37:42     Upper Threshold Critical = 3.640000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = Processor 2 Vcore
20180731 14:37:42     sensorType = 3 - Voltage
20180731 14:37:42     BaseUnits = 5
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 0.800000
20180731 14:37:42     Lower Threshold Non Critical = 0.540000
20180731 14:37:42     Upper Threshold Non Critical = 1.480000
20180731 14:37:42     Lower Threshold Critical = 0.510000
20180731 14:37:42     Upper Threshold Critical = 1.520000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = Fan Device 5 FAN A
20180731 14:37:42     sensorType = 5 - Tachometer
20180731 14:37:42     BaseUnits = 19
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 2700.000000
20180731 14:37:42     Lower Threshold Non Critical = 600.000000
20180731 14:37:42     Upper Threshold Non Critical = 18975.000000
20180731 14:37:42     Lower Threshold Critical = 450.000000
20180731 14:37:42     Upper Threshold Critical = 19050.000000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = Fan Device 4 FAN 4
20180731 14:37:42     sensorType = 5 - Tachometer
20180731 14:37:42     BaseUnits = 19
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 1050.000000
20180731 14:37:42     Lower Threshold Non Critical = 600.000000
20180731 14:37:42     Upper Threshold Non Critical = 18975.000000
20180731 14:37:42     Lower Threshold Critical = 450.000000
20180731 14:37:42     Upper Threshold Critical = 19050.000000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = Fan Device 3 FAN 3
20180731 14:37:42     sensorType = 5 - Tachometer
20180731 14:37:42     BaseUnits = 19
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 2775.000000
20180731 14:37:42     Lower Threshold Non Critical = 600.000000
20180731 14:37:42     Upper Threshold Non Critical = 18975.000000
20180731 14:37:42     Lower Threshold Critical = 450.000000
20180731 14:37:42     Upper Threshold Critical = 19050.000000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = Fan Device 1 FAN 1
20180731 14:37:42     sensorType = 5 - Tachometer
20180731 14:37:42     BaseUnits = 19
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 2700.000000
20180731 14:37:42     Lower Threshold Non Critical = 600.000000
20180731 14:37:42     Upper Threshold Non Critical = 18975.000000
20180731 14:37:42     Lower Threshold Critical = 450.000000
20180731 14:37:42     Upper Threshold Critical = 19050.000000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = System Board 2 Peripheral Temp
20180731 14:37:42     sensorType = 2 - Temperature
20180731 14:37:42     BaseUnits = 2
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 37.000000
20180731 14:37:42     Lower Threshold Non Critical = -5.000000
20180731 14:37:42     Upper Threshold Non Critical = 80.000000
20180731 14:37:42     Lower Threshold Critical = -7.000000
20180731 14:37:42     Upper Threshold Critical = 85.000000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42   Element Name = System Board 1 System Temp
20180731 14:37:42     sensorType = 2 - Temperature
20180731 14:37:42     BaseUnits = 2
20180731 14:37:42     Scaled by = 0.010000 
20180731 14:37:42     Current Reading = 31.000000
20180731 14:37:42     Lower Threshold Non Critical = -5.000000
20180731 14:37:42     Upper Threshold Non Critical = 80.000000
20180731 14:37:42     Lower Threshold Critical = -7.000000
20180731 14:37:42     Upper Threshold Critical = 85.000000
20180731 14:37:42     Element Op Status = 2
20180731 14:37:42 Check classe CIM_Memory
20180731 14:37:43   Element Name = CPU Level-1 Cache
20180731 14:37:43     Element Op Status = 0
20180731 14:37:43   Element Name = CPU Level-2 Cache
20180731 14:37:43     Element Op Status = 0
20180731 14:37:43   Element Name = CPU Level-3 Cache
20180731 14:37:43     Element Op Status = 0
20180731 14:37:43   Element Name = Memory
20180731 14:37:43 Check classe CIM_Processor
20180731 14:37:43   Element Name = CPU
20180731 14:37:43     Family = 179
20180731 14:37:43     CurrentClockSpeed = 3200MHz
20180731 14:37:43     Element Op Status = 2
20180731 14:37:43 Check classe CIM_RecordLog
20180731 14:37:44   Element Name = IPMI SEL
20180731 14:37:44     Element Op Status = 2
20180731 14:37:44 Check classe OMC_DiscreteSensor
20180731 14:37:44   Element Name = System Chassis 1 Chassis Intru: Unknown
20180731 14:37:44     Element Op Status = 2
20180731 14:37:44   Element Name = Processor 1 CPU Temp
20180731 14:37:44   Element Name = Power Supply 2 PS2 Status: Presence detected
20180731 14:37:44 Check classe OMC_Fan
20180731 14:37:45   Element Name = FAN A
20180731 14:37:45     Element Op Status = 2
20180731 14:37:45   Element Name = FAN 4
20180731 14:37:45     Element Op Status = 2
20180731 14:37:45   Element Name = FAN 3
20180731 14:37:45     Element Op Status = 2
20180731 14:37:45   Element Name = FAN 1
20180731 14:37:45     Element Op Status = 2
20180731 14:37:45 Check classe OMC_PowerSupply
20180731 14:37:45   Element Name = Power Supply 2
20180731 14:37:45     Element Op Status = 2
20180731 14:37:45 Check classe VMware_StorageExtent
20180731 14:37:45 Check classe VMware_Controller
20180731 14:37:46 Check classe VMware_StorageVolume
20180731 14:37:46 Check classe VMware_Battery
20180731 14:37:46 Check classe VMware_SASSATAPort
OK - Server: Supermicro X9SCL-II/X9SCM-II s/n: 0123456789 System BIOS: 2.0a 2012-09-17


Workaround für eingeschränkte Benutzerrechte

Erstellen Sie im vSphere Client einen neuen Benutzer mit dem Benutzernamen monitoring.
Der Benutzer scheint in der Benutzerliste auf.

Für die CIM-Abfrage sind bei ESXi root-Rechte erforderlich.[1]

Damit Sie jedoch einen eigenen Benutzer mit deaktiviertem SSH-Zugang und deaktiviertem vCenter-Zugang verwenden können, führen Sie die folgenden Schritte aus:

  1. Erstellen Sie einen neuen Benuter (z.B. mit dem Namen monitoring) im vSphere Client.
  2. Verbinden Sie sich als root-Benutzer per SSH zum ESXi System.
  3. Fügen Sie den neuen Benutzer zur root-Gruppe hinzu. Editieren Sie dazu die Datei /etc/group (mittels vi):
    root:x:0:root,monitoring
  4. Setzen Sie beim neuen Benutzer /sbin/nologin als Login Shell. Damit sperren Sie den SSH Zugriff für diesen Benutzer. Editieren Sie die Datei /etc/passwd und setzen Sie den Eintrag wie folgt:
    monitoring:x:1000:1000:ESXi User:/:/sbin/nologin

Bei unseren Tests hat diese Vorgangsweise dazu geführt, dass die Sensoren ausgelesen werden können, ein SSH Login nicht möglich ist und im vSphere Client kein Zugriff möglich ist (getestet mit ESXi 6.0).

Wir möchten jedoch darauf hinweisen, dass diese Vorgangsweise von VMware nicht offiziell unterstützt wird.

Einbindung in Icinga

Es gibt verschiedene Varianten für die Definition eines Icinga Commands (commands.cfg). Die einfachste Form unter Debian/Ubuntu ist:

# 'check_esxi_hardware' command definition
define command{
command_name check_esxi_hardware
command_line /usr/lib/nagios/plugins/check_esxi_hardware.py -H $HOSTADDRESS$ -U $ARG1$ -P $ARG2$
}

Weitere Varianten finden Sie hier.

Einzelnachweise

Credit

Herzlichen Dank an Sascha Peters für diesen wertvollen Tipp!


Foto Christoph Mitasch.jpg

Autor: Christoph Mitasch

Christoph Mitasch arbeitet in der Abteilung Web Operations & Knowledge Transfer bei Thomas-Krenn. Er ist für die Betreuung und Weiterentwicklung der Webshop Infrastruktur zuständig. Seit einem Studienprojekt zum Thema Hochverfügbarkeit und Daten Replikation unter Linux beschäftigt er sich intensiv mit diesem Themenbereich. Nach einem Praktikum bei IBM Linz schloss er sein Diplomstudium „Computer- und Mediensicherheit“ an der FH Hagenberg ab. Er wohnt in der Nähe von Linz und ist neben der Arbeit ein begeisterter Marathon-Läufer und Jongleur, wo er mehrere Weltrekorde in der Team-Jonglage hält.


Das könnte Sie auch interessieren

P2V SuSE Linux Enterprise Server 9 Migration
VirtualCenter Datenbank: Migrationen (VMotion, DRS) dokumentieren
VMware ESX ESXi mit Online USV herunterfahren