Charts 如何在stable/prometheus chart values.yaml中设置prometheus规则?
使用官方Charts 如何在stable/prometheus chart values.yaml中设置prometheus规则?,charts,configuration,yaml,rules,prometheus,Charts,Configuration,Yaml,Rules,Prometheus,使用官方Prometheus图表stable/Prometheus 自定义其values.yaml文件以设置alertmanager.yml文件和serverFiles区域 在规则:{}: 这是{}。如何像这里那样编写真正的警报规则 例如,我尝试: serverFiles: alerts: {} rules: # Alert for any instance that is unreachable for >5 minutes. - alert: In
Prometheus
图表stable/Prometheus
自定义其values.yaml
文件以设置alertmanager.yml
文件和serverFiles
区域
在规则:{}
:
这是{}
。如何像这里那样编写真正的警报规则
例如,我尝试:
serverFiles:
alerts: {}
rules:
# Alert for any instance that is unreachable for >5 minutes.
- alert: InstanceDown
expr: up == 0
for: 5m
labels:
severity: page
annotations:
summary: "Instance {{ $labels.instance }} down"
description: "{{ $labels.instance }} of job {{ $labels.job }} has been down for more than 5 minutes."
然后运行$helm安装我的_prometheus
。然后pod得到了这个错误:
PersistentVolumeClaim is not bound: "sweet-terrier-prometheus-server"
Back-off restarting failed container
Error syncing pod
规则用于记录规则,警报用于警报规则
在头盔安装或升级过程中,有没有办法“注入”规则,而不是修改
值。yaml
?比如在文件中添加规则,或者不知何故?@alex你找到解决方案了吗?@MikeBevz我已经很长时间没有这样做了-我想我没有,但我没有努力
serverFiles:
alerts:
groups:
- name: NodeAlerts
rules:
- alert: NodeCPUUsage
expr: (100 - (avg(irate(node_cpu{mode="idle"}[5m])) BY (instance) * 100)) > 75
for: 2m
labels:
severity: alert
annotations:
description: '{{$labels.instance}}: CPU usage is above 75% (current value is:
{{ $value }})'
summary: '{{$labels.instance}}: High CPU usage detect