-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Closed
Labels
bugunexpected problem or unintended behaviorunexpected problem or unintended behavior
Milestone
Description
Relevant telegraf.conf:
interval = "60s"
metric_version = 1
timeout = "10s"
System info:
Any OS or telegraf version: this bug is caused due to ipmi itself
Steps to reproduce:
We have a server with failed PSU: we know it because we saw it: it has amber light instead of green and sound alarm too. But ipmi and telegraf detect it status as OK:
...
PS1 Status | 0x03 | ok
PS2 Status | 0x01 | ok
However 0x03 flag is "failure":
https://support.hpe.com/hpsc/doc/public/display?docId=emr_na-sg8039en_us&docLocale=en_US
Expected behavior:
PS1 Status is bad (0)
Actual behavior:
PS1 Status is OK (1)
Additional info:
AFAIK there is no readings with 0x03 and "OK" status. for old motherboards - 0x03 for CPU for example means overheating. But maybe flag check needed only for PSU.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugunexpected problem or unintended behaviorunexpected problem or unintended behavior