Uploaded image for project: 'Observium'
  1. Observium
  2. OBS-762

Alert checks against storage_perc fail when using WMI

Details

    • Bug
    • Resolution: Won't Fix
    • Major
    • None
    • None
    • Alerting, Poller
    • Debian GNU/Linux 7 (wheezy), Observium r5159, Microsoft Active Directory, (Windows Server 2008, 2008 R2, 2012)

    Description

      When enabling WMI checking for windows hosts the "storage" poller module is automatically disabled by observium poller for individual devices that support WMI when they next get polled.

      This is because WMI is now fetching the disk stats?

      We have alert triggers that watch "storage_perc gt 90" and on devices that have WMI enabled they never scan/check/populate. If we manually enable "storage" poller module in that device they will scan, but this module is auto-disabled again on next poll.

      The disk graphs all seem to be populating ok, and the application support enabled by WMI is nice so we would rather not revert back to SNMP only based monitoring.

      Attachments

        Activity

          [OBS-762] Alert checks against storage_perc fail when using WMI

          Has this since started working again? I think Mike did some stuff to the WMI poller, but it's way out of my sphere

          adama Adam Armstrong added a comment - Has this since started working again? I think Mike did some stuff to the WMI poller, but it's way out of my sphere
          slm4996 Solomon Seal added a comment - - edited

          Adam the mssql view patch I was referring to is here http://jira.observium.org/browse/OBSERVIUM-768 in case you haven't seen it yet. It doesn't collect any more data, yet, but it does display all of the data currently collected that was just sitting in the database.

          slm4996 Solomon Seal added a comment - - edited Adam the mssql view patch I was referring to is here http://jira.observium.org/browse/OBSERVIUM-768 in case you haven't seen it yet. It doesn't collect any more data, yet, but it does display all of the data currently collected that was just sitting in the database.

          Feel free to continue the work. We'll not remove the WMI poller so long as it still functions within Observium and is useful, so if you want to try to maintain it for you own use, that's fine.

          We're actually technically still Open Source, as the community edition is still licensed as a derrivative of the QPL. We just introduced a lag between the code being written and it being made available under the QPL because we needed a carrot to encourage people to pay, because, well, food and rent isn't free

          adama Adam Armstrong added a comment - Feel free to continue the work. We'll not remove the WMI poller so long as it still functions within Observium and is useful, so if you want to try to maintain it for you own use, that's fine. We're actually technically still Open Source, as the community edition is still licensed as a derrivative of the QPL. We just introduced a lag between the code being written and it being made available under the QPL because we needed a carrot to encourage people to pay, because, well, food and rent isn't free

          Well, I've already added a patch to extend mssql views for wmi. Maybe I'll take a crack at this if I have time. I know its no longer open source but its still a nice project at a nice price.

          slm4996 Solomon Seal added a comment - Well, I've already added a patch to extend mssql views for wmi. Maybe I'll take a crack at this if I have time. I know its no longer open source but its still a nice project at a nice price.

          The author of the WMI poller has vanished, and it should now be considered abandoned.

          This is why open source sucks, guys.

          adama Adam Armstrong added a comment - The author of the WMI poller has vanished, and it should now be considered abandoned. This is why open source sucks, guys.

          Here is the output of wmic from the poller ran with -d (user,password,domain have been sanitized)

          /bin/wmic --user=user --password=password --workgroup=domain --delimiter=## --namespace='root\CIMV2' -d2 //app-dc "SELECT * FROM Win32_LogicalDisk WHERE Description='Local Fixed Disk'"
          [param/loadparm.c:2462:lp_load()] lp_load: refreshing parameters from /dev/null
          [auth/gensec/gensec.c:1205:gensec_register()] gensec subsystem fake_gssapi_krb5 is disabled
          [auth/gensec/gensec.c:1205:gensec_register()] gensec subsystem gssapi_spnego is disabled
          [librpc/rpc/dcerpc_connect.c:513:continue_map_binding()] Mapped to DCERPC endpoint 135
          [auth/kerberos/kerberos_util.c:236:kinit_to_ccache()] kinit for user@ failed (Cannot contact any KDC for requested realm: unable to reach any KDC in realm )
          [auth/credentials/credentials_krb5.c:300:cli_credentials_get_client_gss_creds()] Failed to get CCACHE for GSSAPI client: Cannot contact any KDC for requested realm
          [auth/gensec/gensec.c:606:gensec_start_mech()] Failed to start GENSEC client mech gssapi_krb5: NT_STATUS_INVALID_PARAMETER
          [lib/com/dcom/main.c:1172:bind_new_pipe()] lib/com/dcom/main.c:1172: dcom_get_pipe: host=app-dc, similar=App-DC[49154]
          [wmi/wmic.c:196:main()] OK   : Login to remote object.
          [wmi/wmic.c:200:main()] OK   : WMI query execute.
          [wmi/wmic.c:203:main()] OK   : Reset result of WMI query.
          [wmi/wmic.c:212:main()] OK   : Retrieve result data.
          CLASS: Win32_LogicalDisk
          Access##Availability##BlockSize##Caption##Compressed##ConfigManagerErrorCode##ConfigManagerUserConfig##CreationClassName##Description##DeviceID##DriveType##ErrorCleared##ErrorDescription##ErrorMethodology##FileSystem##FreeSpace##InstallDate##LastErrorCode##MaximumComponentLength##MediaType##Name##NumberOfBlocks##PNPDeviceID##PowerManagementCapabilities##PowerManagementSupported##ProviderName##Purpose##QuotasDisabled##QuotasIncomplete##QuotasRebuilding##Size##Status##StatusInfo##SupportsDiskQuotas##SupportsFileBasedCompression##SystemCreationClassName##SystemName##VolumeDirty##VolumeName##VolumeSerialNumber
          0##0##0##B:##False##0##False##Win32_LogicalDisk##Local Fixed Disk##B:##3##False##(null)##(null)##NTFS##73773056##(null)##0##255##12##B:##0##(null)##NULL##False##(null)##(null)##True##False##False##104853504##(null)##0##True##True##Win32_ComputerSystem##APP-DC##False##System Reserved##5C158907
          0##0##0##C:##False##0##False##Win32_LogicalDisk##Local Fixed Disk##C:##3##False##(null)##(null)##NTFS##10302267392##(null)##0##255##12##C:##0##(null)##NULL##False##(null)##(null)##True##False##False##48210374656##(null)##0##True##True##Win32_ComputerSystem##APP-DC##False####70221236
          

          slm4996 Solomon Seal added a comment - Here is the output of wmic from the poller ran with -d (user,password,domain have been sanitized) /bin/wmic --user=user --password=password --workgroup=domain --delimiter=## --namespace='root\CIMV2' -d2 //app-dc "SELECT * FROM Win32_LogicalDisk WHERE Description='Local Fixed Disk'" [param/loadparm.c:2462:lp_load()] lp_load: refreshing parameters from /dev/null [auth/gensec/gensec.c:1205:gensec_register()] gensec subsystem fake_gssapi_krb5 is disabled [auth/gensec/gensec.c:1205:gensec_register()] gensec subsystem gssapi_spnego is disabled [librpc/rpc/dcerpc_connect.c:513:continue_map_binding()] Mapped to DCERPC endpoint 135 [auth/kerberos/kerberos_util.c:236:kinit_to_ccache()] kinit for user@ failed (Cannot contact any KDC for requested realm: unable to reach any KDC in realm ) [auth/credentials/credentials_krb5.c:300:cli_credentials_get_client_gss_creds()] Failed to get CCACHE for GSSAPI client: Cannot contact any KDC for requested realm [auth/gensec/gensec.c:606:gensec_start_mech()] Failed to start GENSEC client mech gssapi_krb5: NT_STATUS_INVALID_PARAMETER [lib/com/dcom/main.c:1172:bind_new_pipe()] lib/com/dcom/main.c:1172: dcom_get_pipe: host=app-dc, similar=App-DC[49154] [wmi/wmic.c:196:main()] OK : Login to remote object. [wmi/wmic.c:200:main()] OK : WMI query execute. [wmi/wmic.c:203:main()] OK : Reset result of WMI query. [wmi/wmic.c:212:main()] OK : Retrieve result data. CLASS: Win32_LogicalDisk Access##Availability##BlockSize##Caption##Compressed##ConfigManagerErrorCode##ConfigManagerUserConfig##CreationClassName##Description##DeviceID##DriveType##ErrorCleared##ErrorDescription##ErrorMethodology##FileSystem##FreeSpace##InstallDate##LastErrorCode##MaximumComponentLength##MediaType##Name##NumberOfBlocks##PNPDeviceID##PowerManagementCapabilities##PowerManagementSupported##ProviderName##Purpose##QuotasDisabled##QuotasIncomplete##QuotasRebuilding##Size##Status##StatusInfo##SupportsDiskQuotas##SupportsFileBasedCompression##SystemCreationClassName##SystemName##VolumeDirty##VolumeName##VolumeSerialNumber 0##0##0##B:##False##0##False##Win32_LogicalDisk##Local Fixed Disk##B:##3##False##(null)##(null)##NTFS##73773056##(null)##0##255##12##B:##0##(null)##NULL##False##(null)##(null)##True##False##False##104853504##(null)##0##True##True##Win32_ComputerSystem##APP-DC##False##System Reserved##5C158907 0##0##0##C:##False##0##False##Win32_LogicalDisk##Local Fixed Disk##C:##3##False##(null)##(null)##NTFS##10302267392##(null)##0##255##12##C:##0##(null)##NULL##False##(null)##(null)##True##False##False##48210374656##(null)##0##True##True##Win32_ComputerSystem##APP-DC##False####70221236

          The credentials being used are domain admin credentials and do return disk info when the same server is queried from CLI and tools on other servers.

          slm4996 Solomon Seal added a comment - The credentials being used are domain admin credentials and do return disk info when the same server is queried from CLI and tools on other servers.

          After more research it seems the disk graphs are not updating. I checked the database and they have remained unaltered since the alert last checked dates. WMI seems to be unable to get disk info, for now I am disabling storage via WMI until I hear from you any debugging or other steps to take.

          slm4996 Solomon Seal added a comment - After more research it seems the disk graphs are not updating. I checked the database and they have remained unaltered since the alert last checked dates. WMI seems to be unable to get disk info, for now I am disabling storage via WMI until I hear from you any debugging or other steps to take.

          Correction, the alerts do not fail, they remain "OK" just never get updated. See attached screenshot.

          slm4996 Solomon Seal added a comment - Correction, the alerts do not fail, they remain "OK" just never get updated. See attached screenshot.

          People

            adama Adam Armstrong
            slm4996 Solomon Seal
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: