VMware ESXi Hardware mit Nagios oder Icinga überwachen – Thomas-Krenn-Wiki (2023)

Hinweis: Bitte beachten Sie, dass dieser Artikel / diese Kategorie sich entweder auf ältere Software/Hardware Komponenten bezieht oder aus sonstigen Gründen nicht mehr gewartet wird.
Diese Seite wird nicht mehr aktualisiert und ist rein zu Referenzzwecken noch hier im Archiv abrufbar.

Systemstatus im vSphere Client: Informationen zum LSI RAID Controller werden vom CIM-Provider bereitgestellt. Serversensoren wie RAM-Temperatur, Lüfter-Drehzahlen oder Netzteil-Status werden per IPMI von VMware abgefragt.

Icinga Warnung nach dem Ausfall einer Festplatte in einem Server mit X9SCM-F Mainboard.

VMware vSphere 6.7, VMware vSphere 6.5, VMware vSphere 6.0, vSphere 5.5, vSphere 5.1 und vSphere 5.0 bieten eine integrierte Überwachung der Hardwarekomponenten eines Servers. Den Status dieser Komponenten prüft VMware durch bereits in VMware eingebaute Checks (z.B. für IPMI Sensoren) sowie entsprechender CIM Provider, etwa für Hardware RAID Controller.

Das Plugin check_esxi_hardware.py ermöglicht die einfache Überwachung des Hardware Systemstatus mittels Nagios oder Icinga.

Inhaltsverzeichnis

  • 1 Anforderungen CIM Provider
  • 2 Plugin
  • 3 Verwendung des Plugins
    • 3.1 Workaround für eingeschränkte Benutzerrechte
  • 4 Einbindung in Icinga
  • 5 Einzelnachweise
  • 6 Credit

Anforderungen CIM Provider

Der CIM Provider muss die Informationen zum Hardwarestatus an ESXi weitergeben. Dies ist beispielsweise beim CIM Provider für MegaRAID Controller der Fall:

  • MegaRAID Controller in VMware überwachen, Einschränkungen dazu (Stand 11.06.2014):
    • VMware ESXi 5.1 und 5.5 zeigen bei LSI RAID Controllern UNCONFIGURED GOOD auch bei konfigurierten Laufwerken
    • Bei Systemen mit einer größeren Anzahl an Festplatten kann es dazu kommen dass nach einiger Zeit die Informationen des LSI RAID Controllers nicht mehr angezeigt werden (der "Storage" Subtree verschwindet dabei auch im vSphere Client). Wir haben dieses Problem an LSI gemeldet, an der Lösung wird gearbeitet. Sobald es neue Informationen dazu gibt, aktualisieren wir diesen Artikel.

Hinweis: Der CIM Provider für Adaptec RAID Controller eignet sich dazu nicht (siehe Adaptec RAID Controller in VMware überwachen - Installation CIM Provider und aacraid Treiber)

Hier geht es zu unseren VMware-Server-Systemen im Onlineshop von Thomas-Krenn

Plugin

Das Plugin steht auf folgender Webseite zum Download bereit:

Informationen auf exchange.nagios.org:

Verwendung des Plugins

Die Funktionsweise des Plugins wurde erstmalig von uns mit einem Thomas-Krenn-Server mit einem X8DT3 Mainboard getestet. Auf diesem Server wurde ESXi 5.1 mit integriertem LSI CIM Provider installiert (wird von Thomas Krenn im Download Bereich zur Verfügung gestellt). Dadurch kann auch der Status des LSI 9260-4i RAID Controllers überwacht werden. Die letzten Tests wurden mit einem Supermicro X10 Mainboard, einem MegaRAID 9341-4i mit ESXi 6.5 durchgeführt.

Für die Verwendung des Plugins muss Python sowie die Library pywbem installiert sein. Unter Debian/Ubuntu kann diese mittels

apt-get install python-pywbem

nachinstalliert werden.

Danach kann das Plugin auf der Kommandozeile getestet werden.

Die wichtigsten Parameter des Plugins sind:

  • -H ... IP Adresse des VMware ESXi Servers
  • -U ... Username oder Pfad zur Username-Passwort Datei (file:/path/to/.file)
  • -P ... Passwort oder Pfad zur Passwort-Datei (file:/path/to/.file)
  • -v ... verbose, zeigt alle Sensoren an die abgefragt werden

Zum Testen verwenden wir den root User vom ESXi Server. In einem produktiven Umfeld sollte am vCenter Server ein eigener Benutzer angelegt werden, der nur die Berechtigung hat, die Sensoren auszulesen.

Das Plugin kann wie folgt aufgerufen werden:

python check_esxi_hardware.py -H 10.X.X.X -U root -P passwordWARNING: Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i) WARNING: Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i) - \Server: Supermicro X8DT3 s/n: 1234567890 System BIOS: 2.0a 2010-09-14echo $?1

In diesem Fall kommt eine Warnung, da die RAID Controller Batterie (BBU) noch nicht vollständig geladen ist.

Auf der Webseite des Plugins wird empfohlen das Passwort in einer Datei anzugeben. Dadurch scheint das Passwort nicht in der Prozessliste auf, während der Check ausgeführt wird. Es gibt dafür zwei Varianten.

  • nur das Passwort in der Datei angeben
    • python check_esxi_hardware.py -H 10.X.X.X -U root -P file:/path/to/.file
  • Username und Passwort durch Leerzeichen getrennt in der Datei angeben
    • python check_esxi_hardware.py -H 10.X.X.X -U file:/path/to/.file -P file:/path/to/.file

Interessant ist auch die Verwendung der Option "-v". Dadurch werden alle abgefragten Sensoren sowie deren Status Code angezeigt. Beispiel mit ESXi 5.1:

python check_esxi_hardware.py -H 10.1.102.143 -U tkmon -P relation -v20130430 09:29:33 Connection to https://10.1.102.14320130430 09:29:33 Check classe OMC_SMASHFirmwareIdentity20130430 09:29:33 Element Name = System BIOS20130430 09:29:33 VersionString = 2.0a20130430 09:29:33 Check classe CIM_Chassis20130430 09:29:33 Element Name = Chassis20130430 09:29:33 Manufacturer = Supermicro20130430 09:29:33 SerialNumber = 123456789020130430 09:29:33 Model = X8DT320130430 09:29:33 Element Op Status = 020130430 09:29:33 Check classe CIM_Card20130430 09:29:34 Element Name = Motherboard20130430 09:29:34 Element Op Status = 020130430 09:29:34 Check classe CIM_ComputerSystem20130430 09:29:34 Element Name = System Board 7:120130430 09:29:34 Element Op Status = 020130430 09:29:34 Element Name = localhost20130430 09:29:34 Element Name = Hardware Management Controller (Node 0)20130430 09:29:34 Element Op Status = 020130430 09:29:34 Element Name = Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i)20130430 09:29:34 Element Op Status = 320130430 09:29:34 GLobal exit set to WARNING20130430 09:29:34 Check classe CIM_NumericSensor20130430 09:29:35 Element Name = Memory Device 12 P2-DIMM3B Temp20130430 09:29:35 sensorType = 2 - Temperature20130430 09:29:35 BaseUnits = 220130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 42.00000020130430 09:29:35 Lower Threshold Non Critical = -5.00000020130430 09:29:35 Upper Threshold Non Critical = 75.00000020130430 09:29:35 Lower Threshold Critical = -7.00000020130430 09:29:35 Upper Threshold Critical = 80.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Memory Device 11 P2-DIMM3A Temp20130430 09:29:35 sensorType = 2 - Temperature20130430 09:29:35 BaseUnits = 220130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 45.00000020130430 09:29:35 Lower Threshold Non Critical = -5.00000020130430 09:29:35 Upper Threshold Non Critical = 75.00000020130430 09:29:35 Lower Threshold Critical = -7.00000020130430 09:29:35 Upper Threshold Critical = 80.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Memory Device 10 P2-DIMM2B Temp20130430 09:29:35 sensorType = 2 - Temperature20130430 09:29:35 BaseUnits = 220130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 39.00000020130430 09:29:35 Lower Threshold Non Critical = -5.00000020130430 09:29:35 Upper Threshold Non Critical = 75.00000020130430 09:29:35 Lower Threshold Critical = -7.00000020130430 09:29:35 Upper Threshold Critical = 80.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Memory Device 9 P2-DIMM2A Temp20130430 09:29:35 sensorType = 2 - Temperature20130430 09:29:35 BaseUnits = 220130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 41.00000020130430 09:29:35 Lower Threshold Non Critical = -5.00000020130430 09:29:35 Upper Threshold Non Critical = 75.00000020130430 09:29:35 Lower Threshold Critical = -7.00000020130430 09:29:35 Upper Threshold Critical = 80.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Memory Device 8 P2-DIMM1B Temp20130430 09:29:35 sensorType = 2 - Temperature20130430 09:29:35 BaseUnits = 220130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 39.00000020130430 09:29:35 Lower Threshold Non Critical = -5.00000020130430 09:29:35 Upper Threshold Non Critical = 75.00000020130430 09:29:35 Lower Threshold Critical = -7.00000020130430 09:29:35 Upper Threshold Critical = 80.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Memory Device 7 P2-DIMM1A Temp20130430 09:29:35 sensorType = 2 - Temperature20130430 09:29:35 BaseUnits = 220130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 39.00000020130430 09:29:35 Lower Threshold Non Critical = -5.00000020130430 09:29:35 Upper Threshold Non Critical = 75.00000020130430 09:29:35 Lower Threshold Critical = -7.00000020130430 09:29:35 Upper Threshold Critical = 80.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Fan Device 8 Fan820130430 09:29:35 sensorType = 5 - Tachometer20130430 09:29:35 BaseUnits = 1920130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 1890.00000020130430 09:29:35 Lower Threshold Non Critical = 675.00000020130430 09:29:35 Upper Threshold Non Critical = 34155.00000020130430 09:29:35 Lower Threshold Critical = 540.00000020130430 09:29:35 Upper Threshold Critical = 34290.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Fan Device 7 Fan720130430 09:29:35 sensorType = 5 - Tachometer20130430 09:29:35 BaseUnits = 1920130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 1890.00000020130430 09:29:35 Lower Threshold Non Critical = 675.00000020130430 09:29:35 Upper Threshold Non Critical = 34155.00000020130430 09:29:35 Lower Threshold Critical = 540.00000020130430 09:29:35 Upper Threshold Critical = 34290.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Fan Device 5 Fan520130430 09:29:35 sensorType = 5 - Tachometer20130430 09:29:35 BaseUnits = 1920130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 945.00000020130430 09:29:35 Lower Threshold Non Critical = 675.00000020130430 09:29:35 Upper Threshold Non Critical = 34155.00000020130430 09:29:35 Lower Threshold Critical = 540.00000020130430 09:29:35 Upper Threshold Critical = 34290.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Fan Device 2 Fan220130430 09:29:35 sensorType = 5 - Tachometer20130430 09:29:35 BaseUnits = 1920130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 1080.00000020130430 09:29:35 Lower Threshold Non Critical = 675.00000020130430 09:29:35 Upper Threshold Non Critical = 34155.00000020130430 09:29:35 Lower Threshold Critical = 540.00000020130430 09:29:35 Upper Threshold Critical = 34290.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = Fan Device 1 Fan120130430 09:29:35 sensorType = 5 - Tachometer20130430 09:29:35 BaseUnits = 1920130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 945.00000020130430 09:29:35 Lower Threshold Non Critical = 675.00000020130430 09:29:35 Upper Threshold Non Critical = 34155.00000020130430 09:29:35 Lower Threshold Critical = 540.00000020130430 09:29:35 Upper Threshold Critical = 34290.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 VBAT20130430 09:29:35 sensorType = 3 - Voltage20130430 09:29:35 BaseUnits = 520130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 3.24000020130430 09:29:35 Lower Threshold Non Critical = 2.92000020130430 09:29:35 Upper Threshold Non Critical = 3.64000020130430 09:29:35 Lower Threshold Critical = 2.90000020130430 09:29:35 Upper Threshold Critical = 3.67000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 +12V20130430 09:29:35 sensorType = 3 - Voltage20130430 09:29:35 BaseUnits = 520130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 12.08000020130430 09:29:35 Lower Threshold Non Critical = 10.70000020130430 09:29:35 Upper Threshold Non Critical = 13.25000020130430 09:29:35 Lower Threshold Critical = 10.65000020130430 09:29:35 Upper Threshold Critical = 13.30000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 +5V20130430 09:29:35 sensorType = 3 - Voltage20130430 09:29:35 BaseUnits = 520130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 5.02000020130430 09:29:35 Lower Threshold Non Critical = 4.48000020130430 09:29:35 Upper Threshold Non Critical = 5.53000020130430 09:29:35 Lower Threshold Critical = 4.44000020130430 09:29:35 Upper Threshold Critical = 5.56000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 +3.3VSB20130430 09:29:35 sensorType = 3 - Voltage20130430 09:29:35 BaseUnits = 520130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 3.24000020130430 09:29:35 Lower Threshold Non Critical = 2.92000020130430 09:29:35 Upper Threshold Non Critical = 3.64000020130430 09:29:35 Lower Threshold Critical = 2.90000020130430 09:29:35 Upper Threshold Critical = 3.67000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 +3.3V20130430 09:29:35 sensorType = 3 - Voltage20130430 09:29:35 BaseUnits = 520130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 3.28000020130430 09:29:35 Lower Threshold Non Critical = 2.92000020130430 09:29:35 Upper Threshold Non Critical = 3.64000020130430 09:29:35 Lower Threshold Critical = 2.90000020130430 09:29:35 Upper Threshold Critical = 3.67000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 +1.5V20130430 09:29:35 sensorType = 3 - Voltage20130430 09:29:35 BaseUnits = 520130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 1.52000020130430 09:29:35 Lower Threshold Non Critical = 1.33000020130430 09:29:35 Upper Threshold Non Critical = 1.65000020130430 09:29:35 Lower Threshold Critical = 1.32000020130430 09:29:35 Upper Threshold Critical = 1.66000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 CPU2 DIMM20130430 09:29:35 sensorType = 3 - Voltage20130430 09:29:35 BaseUnits = 520130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 1.58000020130430 09:29:35 Lower Threshold Non Critical = 1.19000020130430 09:29:35 Upper Threshold Non Critical = 1.64000020130430 09:29:35 Lower Threshold Critical = 1.19000020130430 09:29:35 Upper Threshold Critical = 1.65000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 CPU2 Vcore20130430 09:29:35 sensorType = 3 - Voltage20130430 09:29:35 BaseUnits = 520130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 1.04000020130430 09:29:35 Lower Threshold Non Critical = 0.82000020130430 09:29:35 Upper Threshold Non Critical = 1.35000020130430 09:29:35 Lower Threshold Critical = 0.81000020130430 09:29:35 Upper Threshold Critical = 1.36000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Element Name = System Board 1 System Temp20130430 09:29:35 sensorType = 2 - Temperature20130430 09:29:35 BaseUnits = 220130430 09:29:35 Scaled by = 0.010000 20130430 09:29:35 Current Reading = 36.00000020130430 09:29:35 Lower Threshold Non Critical = -5.00000020130430 09:29:35 Upper Threshold Non Critical = 75.00000020130430 09:29:35 Lower Threshold Critical = -7.00000020130430 09:29:35 Upper Threshold Critical = 77.00000020130430 09:29:35 Element Op Status = 220130430 09:29:35 Check classe CIM_Memory20130430 09:29:35 Element Name = CPU 2 Level-1 Cache20130430 09:29:35 Element Op Status = 020130430 09:29:35 Element Name = CPU 2 Level-2 Cache20130430 09:29:35 Element Op Status = 020130430 09:29:35 Element Name = CPU 2 Level-3 Cache20130430 09:29:35 Element Op Status = 020130430 09:29:35 Element Name = Memory20130430 09:29:35 Check classe CIM_Processor20130430 09:29:36 Element Name = CPU 220130430 09:29:36 Family = 17920130430 09:29:36 CurrentClockSpeed = 1866MHz20130430 09:29:36 Element Op Status = 220130430 09:29:36 Check classe CIM_RecordLog20130430 09:29:36 Check classe OMC_DiscreteSensor20130430 09:29:36 Element Name = Power Supply 1 PS Status: Failure status20130430 09:29:36 Element Op Status = 220130430 09:29:36 Element Name = System Chassis 1 Intrusion: General Chassis intrusion20130430 09:29:36 Element Op Status = 220130430 09:29:36 Element Name = Processor 2 CPU2 Temp20130430 09:29:36 Check classe OMC_Fan20130430 09:29:37 Element Name = Fan820130430 09:29:37 Element Op Status = 220130430 09:29:37 Element Name = Fan720130430 09:29:37 Element Op Status = 220130430 09:29:37 Element Name = Fan520130430 09:29:37 Element Op Status = 220130430 09:29:37 Element Name = Fan220130430 09:29:37 Element Op Status = 220130430 09:29:37 Element Name = Fan120130430 09:29:37 Element Op Status = 220130430 09:29:37 Check classe OMC_PowerSupply20130430 09:29:37 Element Name = Power Supply 120130430 09:29:37 Element Op Status = 220130430 09:29:37 Check classe VMware_StorageExtent20130430 09:29:38 Element Name = Drive 252_5 on controller 500605B00418BB20 Fw: n/a - UNCONFIGURED GOOD20130430 09:29:38 Element Op Status = 220130430 09:29:38 Element Name = Drive 252_4 on controller 500605B00418BB20 Fw: n/a - UNCONFIGURED GOOD20130430 09:29:38 Element Op Status = 220130430 09:29:38 Check classe VMware_Controller20130430 09:29:38 Element Name = Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i)20130430 09:29:38 Element Op Status = 320130430 09:29:38 GLobal exit set to WARNING20130430 09:29:38 Check classe VMware_StorageVolume20130430 09:29:39 Element Name = RAID 1 StorageVolume Logical Volume 500605B00418BB20_0 on controller 500605B00418BB20, Drives( - OPTIMAL20130430 09:29:39 Element Op Status = 220130430 09:29:39 Check classe VMware_Battery20130430 09:29:39 Element Name = Battery 934 on Controller 500605B00418BB2020130430 09:29:39 Element Op Status = 1120130430 09:29:39 Check classe VMware_SASSATAPort20130430 09:29:39 Element Name = Port 0 on Controller 500605B00418BB2020130430 09:29:39 Element Op Status = 220130430 09:29:39 Element Name = Port 1 on Controller 500605B00418BB2020130430 09:29:39 Element Op Status = 2 WARNING: Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i) WARNING: Controller 500605B00418BB20 (LSI MegaRAID SAS 9260-4i) -\ Server: Supermicro X8DT3 s/n: 1234567890 System BIOS: 2.0a 2010-09-14

Und ein Beispiel mit ESXi 6.5:

tkmon@tkmon:~$ /usr/lib/nagios/plugins/check_esxi_hardware.py --host=10.2.1.169 --user=root --pass=********** --verbose20170823 22:13:15 Connection to https://10.2.1.16920170823 22:13:15 Found pywbem version 0.8.0-dev20170823 22:13:15 Check classe OMC_SMASHFirmwareIdentity20170823 22:13:15 Element Name = System BIOS20170823 22:13:15 VersionString = 2.0a20170823 22:13:15 Check classe CIM_Chassis20170823 22:13:15 Element Name = Chassis20170823 22:13:15 Manufacturer = Supermicro20170823 22:13:15 SerialNumber = 012345678920170823 22:13:15 Model = Super Server20170823 22:13:15 Element Op Status = 020170823 22:13:15 Check classe CIM_Card20170823 22:13:16 Element Name = Motherboard20170823 22:13:16 Element Op Status = 020170823 22:13:16 Check classe CIM_ComputerSystem20170823 22:13:16 Element Name = System Board 7:120170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:220170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:320170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:1220170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:1520170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:1720170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:1820170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:1920170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:2020170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:2120170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:3220170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = System Board 7:3320170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = localhost.intern.thomas-krenn.com20170823 22:13:16 Element Name = Hardware Management Controller (Node 0)20170823 22:13:16 Element Op Status = 020170823 22:13:16 Element Name = Controller 500605B00CDBC930 (LSI MegaRAID SAS 9341-4i)20170823 22:13:16 Element Op Status = 220170823 22:13:16 Check classe CIM_NumericSensor20170823 22:13:18 Element Name = System Board 12 1.05V PCH20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.05000020170823 22:13:18 Lower Threshold Non Critical = 0.94000020170823 22:13:18 Upper Threshold Non Critical = 1.19000020170823 22:13:18 Lower Threshold Critical = 0.89000020170823 22:13:18 Upper Threshold Critical = 1.22000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 21 1.2V BMC20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.21000020170823 22:13:18 Lower Threshold Non Critical = 1.09000020170823 22:13:18 Upper Threshold Non Critical = 1.34000020170823 22:13:18 Lower Threshold Critical = 1.04000020170823 22:13:18 Upper Threshold Critical = 1.37000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 20 1.5V PCH20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.50000020170823 22:13:18 Lower Threshold Non Critical = 1.40000020170823 22:13:18 Upper Threshold Non Critical = 1.64000020170823 22:13:18 Lower Threshold Critical = 1.34000020170823 22:13:18 Upper Threshold Critical = 1.67000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 19 3.3VSB20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 3.24000020170823 22:13:18 Lower Threshold Non Critical = 2.95000020170823 22:13:18 Upper Threshold Non Critical = 3.55000020170823 22:13:18 Lower Threshold Critical = 2.82000020170823 22:13:18 Upper Threshold Critical = 3.65000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 15 5VSB20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 4.92000020170823 22:13:18 Lower Threshold Non Critical = 4.48000020170823 22:13:18 Upper Threshold Non Critical = 5.39000020170823 22:13:18 Lower Threshold Critical = 4.29000020170823 22:13:18 Upper Threshold Critical = 5.54000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Device 4 VDIMMGH20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.20000020170823 22:13:18 Lower Threshold Non Critical = 1.04000020170823 22:13:18 Upper Threshold Non Critical = 1.34000020170823 22:13:18 Lower Threshold Critical = 0.97000020170823 22:13:18 Upper Threshold Critical = 1.42000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Device 3 VDIMMEF20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.20000020170823 22:13:18 Lower Threshold Non Critical = 1.04000020170823 22:13:18 Upper Threshold Non Critical = 1.34000020170823 22:13:18 Lower Threshold Critical = 0.97000020170823 22:13:18 Upper Threshold Critical = 1.42000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Device 2 VDIMMCD20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.20000020170823 22:13:18 Lower Threshold Non Critical = 1.04000020170823 22:13:18 Upper Threshold Non Critical = 1.34000020170823 22:13:18 Lower Threshold Critical = 0.97000020170823 22:13:18 Upper Threshold Critical = 1.42000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Device 1 VDIMMAB20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.20000020170823 22:13:18 Lower Threshold Non Critical = 1.04000020170823 22:13:18 Upper Threshold Non Critical = 1.34000020170823 22:13:18 Lower Threshold Critical = 0.97000020170823 22:13:18 Upper Threshold Critical = 1.42000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Processor 4 Vcpu220170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.80000020170823 22:13:18 Lower Threshold Non Critical = 1.39000020170823 22:13:18 Upper Threshold Non Critical = 1.89000020170823 22:13:18 Lower Threshold Critical = 1.26000020170823 22:13:18 Upper Threshold Critical = 2.08000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Processor 3 Vcpu120170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 1.80000020170823 22:13:18 Lower Threshold Non Critical = 1.39000020170823 22:13:18 Upper Threshold Non Critical = 1.89000020170823 22:13:18 Lower Threshold Critical = 1.26000020170823 22:13:18 Upper Threshold Critical = 2.08000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 18 VBAT20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 2.79000020170823 22:13:18 Lower Threshold Non Critical = 2.50000020170823 22:13:18 Upper Threshold Non Critical = 3.67000020170823 22:13:18 Lower Threshold Critical = 2.43000020170823 22:13:18 Upper Threshold Critical = 3.78000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 32 3.3VCC20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 3.35000020170823 22:13:18 Lower Threshold Non Critical = 2.95000020170823 22:13:18 Upper Threshold Non Critical = 3.55000020170823 22:13:18 Lower Threshold Critical = 2.82000020170823 22:13:18 Upper Threshold Critical = 3.65000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 33 5VCC20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 5.00000020170823 22:13:18 Lower Threshold Non Critical = 4.48000020170823 22:13:18 Upper Threshold Non Critical = 5.39000020170823 22:13:18 Lower Threshold Critical = 4.29000020170823 22:13:18 Upper Threshold Critical = 5.54000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 17 12V20170823 22:13:18 sensorType = 3 - Voltage20170823 22:13:18 BaseUnits = 520170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 12.12000020170823 22:13:18 Lower Threshold Non Critical = 10.74000020170823 22:13:18 Upper Threshold Non Critical = 12.94000020170823 22:13:18 Lower Threshold Critical = 10.29000020170823 22:13:18 Upper Threshold Critical = 13.26000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Fan Device 5 FAN520170823 22:13:18 sensorType = 5 - Tachometer20170823 22:13:18 BaseUnits = 1920170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 3000.00000020170823 22:13:18 Lower Threshold Non Critical = 700.00000020170823 22:13:18 Upper Threshold Non Critical = 25300.00000020170823 22:13:18 Lower Threshold Critical = 500.00000020170823 22:13:18 Upper Threshold Critical = 25400.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Fan Device 2 FAN220170823 22:13:18 sensorType = 5 - Tachometer20170823 22:13:18 BaseUnits = 1920170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 2300.00000020170823 22:13:18 Lower Threshold Non Critical = 700.00000020170823 22:13:18 Upper Threshold Non Critical = 25300.00000020170823 22:13:18 Lower Threshold Critical = 500.00000020170823 22:13:18 Upper Threshold Critical = 25400.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Device 84 P2-DIMMF1 Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 34.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 80.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 85.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Device 80 P2-DIMME1 Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 36.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 80.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 85.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Device 68 P1-DIMMB1 Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 37.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 80.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 85.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Device 64 P1-DIMMA1 Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 39.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 80.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 85.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Module 6 VmemGHVRM Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 36.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 95.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 100.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Module 5 VmemEFVRM Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 44.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 95.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 100.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Module 4 VmemCDVRM Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 38.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 95.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 100.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Module 3 VmemABVRM Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 39.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 95.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 100.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Module 2 Vcpu2VRM Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 44.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 95.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 100.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Memory Module 1 Vcpu1VRM Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 49.00000020170823 22:13:18 Lower Threshold Non Critical = 5.00000020170823 22:13:18 Upper Threshold Non Critical = 95.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 100.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 2 Peripheral Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 48.00000020170823 22:13:18 Lower Threshold Non Critical = 0.00000020170823 22:13:18 Upper Threshold Non Critical = 80.00000020170823 22:13:18 Lower Threshold Critical = -5.00000020170823 22:13:18 Upper Threshold Critical = 85.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 1 System Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 35.00000020170823 22:13:18 Lower Threshold Non Critical = 0.00000020170823 22:13:18 Upper Threshold Non Critical = 80.00000020170823 22:13:18 Lower Threshold Critical = -5.00000020170823 22:13:18 Upper Threshold Critical = 85.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = System Board 3 PCH Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 50.00000020170823 22:13:18 Lower Threshold Non Critical = 16.00000020170823 22:13:18 Upper Threshold Non Critical = 90.00000020170823 22:13:18 Lower Threshold Critical = 5.00000020170823 22:13:18 Upper Threshold Critical = 95.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Processor 2 CPU2 Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 39.00000020170823 22:13:18 Lower Threshold Non Critical = 0.00000020170823 22:13:18 Upper Threshold Non Critical = 85.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 90.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = Processor 1 CPU1 Temp20170823 22:13:18 sensorType = 2 - Temperature20170823 22:13:18 BaseUnits = 220170823 22:13:18 Scaled by = 0.01000020170823 22:13:18 Current Reading = 37.00000020170823 22:13:18 Lower Threshold Non Critical = 0.00000020170823 22:13:18 Upper Threshold Non Critical = 85.00000020170823 22:13:18 Lower Threshold Critical = 0.00000020170823 22:13:18 Upper Threshold Critical = 90.00000020170823 22:13:18 Element Op Status = 220170823 22:13:18 Check classe CIM_Memory20170823 22:13:18 Element Name = CPU1 Level-1 Cache20170823 22:13:18 Element Op Status = 020170823 22:13:18 Element Name = CPU1 Level-2 Cache20170823 22:13:18 Element Op Status = 020170823 22:13:18 Element Name = CPU1 Level-3 Cache20170823 22:13:18 Element Op Status = 020170823 22:13:18 Element Name = CPU2 Level-1 Cache20170823 22:13:18 Element Op Status = 020170823 22:13:18 Element Name = CPU2 Level-2 Cache20170823 22:13:18 Element Op Status = 020170823 22:13:18 Element Name = CPU2 Level-3 Cache20170823 22:13:18 Element Op Status = 020170823 22:13:18 Element Name = Memory20170823 22:13:18 Check classe CIM_Processor20170823 22:13:18 Element Name = CPU120170823 22:13:18 Family = 17920170823 22:13:18 CurrentClockSpeed = 1700MHz20170823 22:13:18 Element Op Status = 220170823 22:13:18 Element Name = CPU220170823 22:13:18 Family = 17920170823 22:13:18 CurrentClockSpeed = 1700MHz20170823 22:13:18 Element Op Status = 220170823 22:13:18 Check classe CIM_RecordLog20170823 22:13:19 Element Name = IPMI SEL20170823 22:13:19 Element Op Status = 220170823 22:13:19 Check classe OMC_DiscreteSensor20170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: General Chassis intrusion20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Drive Bay intrusion20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: I/O Card area intrusion20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Processor area intrusion20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: System unplugged from LAN20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unauthorized dock20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: FAN area intrusion20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unknown20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unknown20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unknown20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unknown20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unknown20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unknown20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unknown20170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = System Chassis 1 Chassis Intru: Unknown20170823 22:13:19 Element Op Status = 220170823 22:13:19 Check classe OMC_Fan20170823 22:13:19 Element Name = FAN520170823 22:13:19 Element Op Status = 220170823 22:13:19 Element Name = FAN220170823 22:13:19 Element Op Status = 220170823 22:13:19 Check classe OMC_PowerSupply20170823 22:13:20 Check classe VMware_StorageExtent20170823 22:13:20 Element Name = Drive 62_4 on controller 500605B00CDBC930 Fw: 1V02 - ONLINE20170823 22:13:20 Element Op Status = 220170823 22:13:20 Element Name = Drive 62_5 on controller 500605B00CDBC930 Fw: 1V02 - ONLINE20170823 22:13:20 Element Op Status = 220170823 22:13:20 Check classe VMware_Controller20170823 22:13:20 Element Name = Controller 500605B00CDBC930 (LSI MegaRAID SAS 9341-4i)20170823 22:13:20 Element Op Status = 220170823 22:13:20 Check classe VMware_StorageVolume20170823 22:13:21 Element Name = RAID 1 Data Logical Volume 500605B00CDBC930_0 on controller 500605B00CDBC930, Drives( - OPTIMAL20170823 22:13:21 Element Op Status = 220170823 22:13:21 Check classe VMware_Battery20170823 22:13:21 Check classe VMware_SASSATAPort20170823 22:13:21 Element Name = Port 0 on Controller 500605B00CDBC93020170823 22:13:21 Element Op Status = 220170823 22:13:21 Element Name = Port 1 on Controller 500605B00CDBC93020170823 22:13:21 Element Op Status = 2OK - Server: Supermicro Super Server s/n: 0123456789 System BIOS: 2.0a 2016-08-25tkmon@tkmon:~$

Beispielausgabe unter ESXi 6.7, hier werden aktuell keine Daten eines MegaRAID Controllers ausgegeben. Der verfügbare SMIS Provider funktioniert unter ESXi 6.7 noch nicht korrekt:

tkmon@tkmon:~$ /usr/lib/nagios/plugins/check_esxi_hardware.py --host=10.1.102.52 --user=root --pass=Relation1234$ --verbose20180731 14:37:40 Connection to https://10.1.102.5220180731 14:37:40 Found pywbem version 0.8.0-dev20180731 14:37:40 Check classe OMC_SMASHFirmwareIdentity20180731 14:37:40 Element Name = System BIOS20180731 14:37:40 VersionString = 2.0a20180731 14:37:40 Check classe CIM_Chassis20180731 14:37:41 Element Name = Chassis20180731 14:37:41 Manufacturer = Supermicro20180731 14:37:41 SerialNumber = 012345678920180731 14:37:41 Model = X9SCL-II/X9SCM-II20180731 14:37:41 Element Op Status = 020180731 14:37:41 Check classe CIM_Card20180731 14:37:41 Element Name = Motherboard20180731 14:37:41 Element Op Status = 020180731 14:37:41 Check classe CIM_ComputerSystem20180731 14:37:41 Element Name = System Board 7:120180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = System Board 7:220180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = System Board 7:1720180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = System Board 7:1820180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = System Board 7:3220180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = System Board 7:3320180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = System Board 7:3420180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = System Board 7:3520180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = System Board 7:3620180731 14:37:41 Element Op Status = 020180731 14:37:41 Element Name = localhost.tdz.thomas-krenn.com20180731 14:37:41 Element Name = Hardware Management Controller (Node 0)20180731 14:37:41 Element Op Status = 020180731 14:37:41 Check classe CIM_NumericSensor20180731 14:37:42 Element Name = System Board 36 AVCC20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 3.37000020180731 14:37:42 Lower Threshold Non Critical = 2.94000020180731 14:37:42 Upper Threshold Non Critical = 3.58000020180731 14:37:42 Lower Threshold Critical = 2.88000020180731 14:37:42 Upper Threshold Critical = 3.64000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = System Board 35 VSB20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 3.31000020180731 14:37:42 Lower Threshold Non Critical = 2.94000020180731 14:37:42 Upper Threshold Non Critical = 3.58000020180731 14:37:42 Lower Threshold Critical = 2.88000020180731 14:37:42 Upper Threshold Critical = 3.64000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = System Board 18 VBAT20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 3.05000020180731 14:37:42 Lower Threshold Non Critical = 2.94000020180731 14:37:42 Upper Threshold Non Critical = 3.58000020180731 14:37:42 Lower Threshold Critical = 2.88000020180731 14:37:42 Upper Threshold Critical = 3.64000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = System Board 34 -12V20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = -12.09000020180731 14:37:42 Lower Threshold Non Critical = -13.45000020180731 14:37:42 Upper Threshold Non Critical = -10.93000020180731 14:37:42 Lower Threshold Critical = -13.65000020180731 14:37:42 Upper Threshold Critical = -10.74000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = System Board 33 5VCC20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 5.02000020180731 14:37:42 Lower Threshold Non Critical = 4.57000020180731 14:37:42 Upper Threshold Non Critical = 5.34000020180731 14:37:42 Lower Threshold Critical = 4.32000020180731 14:37:42 Upper Threshold Critical = 5.60000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = Memory Device 1 VDIMM20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 1.50000020180731 14:37:42 Lower Threshold Non Critical = 1.28000020180731 14:37:42 Upper Threshold Non Critical = 1.76000020180731 14:37:42 Lower Threshold Critical = 1.21000020180731 14:37:42 Upper Threshold Critical = 1.77000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = System Board 17 12V20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 12.08000020180731 14:37:42 Lower Threshold Non Critical = 10.70000020180731 14:37:42 Upper Threshold Non Critical = 13.09000020180731 14:37:42 Lower Threshold Critical = 10.60000020180731 14:37:42 Upper Threshold Critical = 13.19000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = System Board 32 3.3VCC20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 3.37000020180731 14:37:42 Lower Threshold Non Critical = 2.94000020180731 14:37:42 Upper Threshold Non Critical = 3.58000020180731 14:37:42 Lower Threshold Critical = 2.88000020180731 14:37:42 Upper Threshold Critical = 3.64000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = Processor 2 Vcore20180731 14:37:42 sensorType = 3 - Voltage20180731 14:37:42 BaseUnits = 520180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 0.80000020180731 14:37:42 Lower Threshold Non Critical = 0.54000020180731 14:37:42 Upper Threshold Non Critical = 1.48000020180731 14:37:42 Lower Threshold Critical = 0.51000020180731 14:37:42 Upper Threshold Critical = 1.52000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = Fan Device 5 FAN A20180731 14:37:42 sensorType = 5 - Tachometer20180731 14:37:42 BaseUnits = 1920180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 2700.00000020180731 14:37:42 Lower Threshold Non Critical = 600.00000020180731 14:37:42 Upper Threshold Non Critical = 18975.00000020180731 14:37:42 Lower Threshold Critical = 450.00000020180731 14:37:42 Upper Threshold Critical = 19050.00000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = Fan Device 4 FAN 420180731 14:37:42 sensorType = 5 - Tachometer20180731 14:37:42 BaseUnits = 1920180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 1050.00000020180731 14:37:42 Lower Threshold Non Critical = 600.00000020180731 14:37:42 Upper Threshold Non Critical = 18975.00000020180731 14:37:42 Lower Threshold Critical = 450.00000020180731 14:37:42 Upper Threshold Critical = 19050.00000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = Fan Device 3 FAN 320180731 14:37:42 sensorType = 5 - Tachometer20180731 14:37:42 BaseUnits = 1920180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 2775.00000020180731 14:37:42 Lower Threshold Non Critical = 600.00000020180731 14:37:42 Upper Threshold Non Critical = 18975.00000020180731 14:37:42 Lower Threshold Critical = 450.00000020180731 14:37:42 Upper Threshold Critical = 19050.00000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = Fan Device 1 FAN 120180731 14:37:42 sensorType = 5 - Tachometer20180731 14:37:42 BaseUnits = 1920180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 2700.00000020180731 14:37:42 Lower Threshold Non Critical = 600.00000020180731 14:37:42 Upper Threshold Non Critical = 18975.00000020180731 14:37:42 Lower Threshold Critical = 450.00000020180731 14:37:42 Upper Threshold Critical = 19050.00000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = System Board 2 Peripheral Temp20180731 14:37:42 sensorType = 2 - Temperature20180731 14:37:42 BaseUnits = 220180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 37.00000020180731 14:37:42 Lower Threshold Non Critical = -5.00000020180731 14:37:42 Upper Threshold Non Critical = 80.00000020180731 14:37:42 Lower Threshold Critical = -7.00000020180731 14:37:42 Upper Threshold Critical = 85.00000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Element Name = System Board 1 System Temp20180731 14:37:42 sensorType = 2 - Temperature20180731 14:37:42 BaseUnits = 220180731 14:37:42 Scaled by = 0.010000 20180731 14:37:42 Current Reading = 31.00000020180731 14:37:42 Lower Threshold Non Critical = -5.00000020180731 14:37:42 Upper Threshold Non Critical = 80.00000020180731 14:37:42 Lower Threshold Critical = -7.00000020180731 14:37:42 Upper Threshold Critical = 85.00000020180731 14:37:42 Element Op Status = 220180731 14:37:42 Check classe CIM_Memory20180731 14:37:43 Element Name = CPU Level-1 Cache20180731 14:37:43 Element Op Status = 020180731 14:37:43 Element Name = CPU Level-2 Cache20180731 14:37:43 Element Op Status = 020180731 14:37:43 Element Name = CPU Level-3 Cache20180731 14:37:43 Element Op Status = 020180731 14:37:43 Element Name = Memory20180731 14:37:43 Check classe CIM_Processor20180731 14:37:43 Element Name = CPU20180731 14:37:43 Family = 17920180731 14:37:43 CurrentClockSpeed = 3200MHz20180731 14:37:43 Element Op Status = 220180731 14:37:43 Check classe CIM_RecordLog20180731 14:37:44 Element Name = IPMI SEL20180731 14:37:44 Element Op Status = 220180731 14:37:44 Check classe OMC_DiscreteSensor20180731 14:37:44 Element Name = System Chassis 1 Chassis Intru: Unknown20180731 14:37:44 Element Op Status = 220180731 14:37:44 Element Name = Processor 1 CPU Temp20180731 14:37:44 Element Name = Power Supply 2 PS2 Status: Presence detected20180731 14:37:44 Check classe OMC_Fan20180731 14:37:45 Element Name = FAN A20180731 14:37:45 Element Op Status = 220180731 14:37:45 Element Name = FAN 420180731 14:37:45 Element Op Status = 220180731 14:37:45 Element Name = FAN 320180731 14:37:45 Element Op Status = 220180731 14:37:45 Element Name = FAN 120180731 14:37:45 Element Op Status = 220180731 14:37:45 Check classe OMC_PowerSupply20180731 14:37:45 Element Name = Power Supply 220180731 14:37:45 Element Op Status = 220180731 14:37:45 Check classe VMware_StorageExtent20180731 14:37:45 Check classe VMware_Controller20180731 14:37:46 Check classe VMware_StorageVolume20180731 14:37:46 Check classe VMware_Battery20180731 14:37:46 Check classe VMware_SASSATAPortOK - Server: Supermicro X9SCL-II/X9SCM-II s/n: 0123456789 System BIOS: 2.0a 2012-09-17

Workaround für eingeschränkte Benutzerrechte

Erstellen Sie im vSphere Client einen neuen Benutzer mit dem Benutzernamen monitoring.

Der Benutzer scheint in der Benutzerliste auf.

Für die CIM-Abfrage sind bei ESXi root-Rechte erforderlich.[1]

Damit Sie jedoch einen eigenen Benutzer mit deaktiviertem SSH-Zugang und deaktiviertem vCenter-Zugang verwenden können, führen Sie die folgenden Schritte aus:

  1. Erstellen Sie einen neuen Benuter (z.B. mit dem Namen monitoring) im vSphere Client.
  2. Verbinden Sie sich als root-Benutzer per SSH zum ESXi System.
  3. Fügen Sie den neuen Benutzer zur root-Gruppe hinzu. Editieren Sie dazu die Datei /etc/group (mittels vi):
    root:x:0:root,monitoring
  4. Setzen Sie beim neuen Benutzer /sbin/nologin als Login Shell. Damit sperren Sie den SSH Zugriff für diesen Benutzer. Editieren Sie die Datei /etc/passwd und setzen Sie den Eintrag wie folgt:
    monitoring:x:1000:1000:ESXi User:/:/sbin/nologin

Bei unseren Tests hat diese Vorgangsweise dazu geführt, dass die Sensoren ausgelesen werden können, ein SSH Login nicht möglich ist und im vSphere Client kein Zugriff möglich ist (getestet mit ESXi 6.0).

Wir möchten jedoch darauf hinweisen, dass diese Vorgangsweise von VMware nicht offiziell unterstützt wird.

Einbindung in Icinga

Es gibt verschiedene Varianten für die Definition eines Icinga Commands (commands.cfg). Die einfachste Form unter Debian/Ubuntu ist:

# 'check_esxi_hardware' command definitiondefine command{command_name check_esxi_hardwarecommand_line /usr/lib/nagios/plugins/check_esxi_hardware.py -H $HOSTADDRESS$ -U $ARG1$ -P $ARG2$}

Weitere Varianten finden Sie hier.

Einzelnachweise

Credit

Herzlichen Dank an Sascha Peters für diesen wertvollen Tipp!

Autor: Christoph Mitasch

Christoph Mitasch arbeitet in der Abteilung Web Operations & Knowledge Transfer bei Thomas-Krenn. Er ist für die Betreuung und Weiterentwicklung der Webshop Infrastruktur zuständig. Seit einem Studienprojekt zum Thema Hochverfügbarkeit und Daten Replikation unter Linux beschäftigt er sich intensiv mit diesem Themenbereich. Nach einem Praktikum bei IBM Linz schloss er sein Diplomstudium „Computer- und Mediensicherheit“ an der FH Hagenberg ab. Er wohnt in der Nähe von Linz und ist neben der Arbeit ein begeisterter Marathon-Läufer und Jongleur, wo er mehrere Weltrekorde in der Team-Jonglage hält.

Top Articles
Latest Posts
Article information

Author: Kerri Lueilwitz

Last Updated: 04/06/2023

Views: 5889

Rating: 4.7 / 5 (47 voted)

Reviews: 86% of readers found this page helpful

Author information

Name: Kerri Lueilwitz

Birthday: 1992-10-31

Address: Suite 878 3699 Chantelle Roads, Colebury, NC 68599

Phone: +6111989609516

Job: Chief Farming Manager

Hobby: Mycology, Stone skipping, Dowsing, Whittling, Taxidermy, Sand art, Roller skating

Introduction: My name is Kerri Lueilwitz, I am a courageous, gentle, quaint, thankful, outstanding, brave, vast person who loves writing and wants to share my knowledge and understanding with you.