Charts 如何在stable/prometheus chart values.yaml中设置prometheus规则?

Charts 如何在stable/prometheus chart values.yaml中设置prometheus规则?,charts,configuration,yaml,rules,prometheus,Charts,Configuration,Yaml,Rules,Prometheus,使用官方Prometheus图表stable/Prometheus 自定义其values.yaml文件以设置alertmanager.yml文件和serverFiles区域 在规则:{}: 这是{}。如何像这里那样编写真正的警报规则 例如,我尝试: serverFiles: alerts: {} rules: # Alert for any instance that is unreachable for >5 minutes. - alert: In

使用官方
Prometheus
图表
stable/Prometheus

自定义其
values.yaml
文件以设置
alertmanager.yml
文件和
serverFiles
区域

规则:{}

这是
{}
。如何像这里那样编写真正的警报规则

例如,我尝试:

  serverFiles:
    alerts: {}
    rules:
    # Alert for any instance that is unreachable for >5 minutes.
    - alert: InstanceDown
      expr: up == 0
      for: 5m
      labels:
        severity: page
      annotations:
        summary: "Instance {{ $labels.instance }} down"
      description: "{{ $labels.instance }} of job {{ $labels.job }} has been down for more than 5 minutes."
然后运行
$helm安装我的_prometheus
。然后pod得到了这个错误:

PersistentVolumeClaim is not bound: "sweet-terrier-prometheus-server"
Back-off restarting failed container
Error syncing pod
规则用于记录规则,警报用于警报规则


在头盔安装或升级过程中,有没有办法“注入”规则,而不是修改
值。yaml
?比如在文件中添加规则,或者不知何故?@alex你找到解决方案了吗?@MikeBevz我已经很长时间没有这样做了-我想我没有,但我没有努力
serverFiles:
  alerts:
    groups:
    - name: NodeAlerts
      rules:
      - alert: NodeCPUUsage
        expr: (100 - (avg(irate(node_cpu{mode="idle"}[5m])) BY (instance) * 100)) > 75
        for: 2m
        labels:
          severity: alert
        annotations:
          description: '{{$labels.instance}}: CPU usage is above 75% (current value is:
            {{ $value }})'
          summary: '{{$labels.instance}}: High CPU usage detect