Charts 如何在stable/prometheus chart values.yaml中设置prometheus规则？_Charts_Configuration_Yaml_Rules_Prometheus

Charts 如何在stable/prometheus chart values.yaml中设置prometheus规则？

charts configuration yaml prometheus

Charts 如何在stable/prometheus chart values.yaml中设置prometheus规则？,charts,configuration,yaml,rules,prometheus,Charts,Configuration,Yaml,Rules,Prometheus,使用官方Prometheus图表stable/Prometheus 自定义其values.yaml文件以设置alertmanager.yml文件和serverFiles区域在规则：{}：这是{}。如何像这里那样编写真正的警报规则例如，我尝试： serverFiles: alerts: {} rules: # Alert for any instance that is unreachable for >5 minutes. - alert: In

使用官方

Prometheus

图表

stable/Prometheus

自定义其

values.yaml

文件以设置

alertmanager.yml

文件和

serverFiles

区域

在

规则：{}

：

这是

{}

。如何像这里那样编写真正的警报规则

例如，我尝试：

  serverFiles:
    alerts: {}
    rules:
    # Alert for any instance that is unreachable for >5 minutes.
    - alert: InstanceDown
      expr: up == 0
      for: 5m
      labels:
        severity: page
      annotations:
        summary: "Instance {{ $labels.instance }} down"
      description: "{{ $labels.instance }} of job {{ $labels.job }} has been down for more than 5 minutes."

然后运行

$helm安装我的_prometheus

。然后pod得到了这个错误：

PersistentVolumeClaim is not bound: "sweet-terrier-prometheus-server"
Back-off restarting failed container
Error syncing pod

规则用于记录规则，警报用于警报规则

在头盔安装或升级过程中，有没有办法“注入”规则，而不是修改

值。yaml

？比如在文件中添加规则，或者不知何故？@alex你找到解决方案了吗？@MikeBevz我已经很长时间没有这样做了-我想我没有，但我没有努力

serverFiles:
  alerts:
    groups:
    - name: NodeAlerts
      rules:
      - alert: NodeCPUUsage
        expr: (100 - (avg(irate(node_cpu{mode="idle"}[5m])) BY (instance) * 100)) > 75
        for: 2m
        labels:
          severity: alert
        annotations:
          description: '{{$labels.instance}}: CPU usage is above 75% (current value is:
            {{ $value }})'
          summary: '{{$labels.instance}}: High CPU usage detect