zoukankan      html  css  js  c++  java
  • prometheus linux系统告警规则 实例

    #prometheus linux系统告警规则 实例

    根据实际情况修改参数

     #rules.linux.yml
    groups:
    - name: linux
      rules:
      - alert: Node-Down #告警名称
        expr: up == 0
        for: 1m #持续多久后发送
        labels:
          severity: warning
        annotations: #信息
          summary: "Node has been down"
          description: "has been down "
    
      - alert: "内存使用率过高"
        expr: round(100- node_memory_MemAvailable_bytes/node_memory_MemTotal_bytes*100) > 80
        for: 1m
        labels:
          severity: warning
        annotations:
          summary: "内存使用率过高"
          description: "当前使用率{{ $value }}%"
    
      - alert: "CPU使用率过高"
        expr: round(100 - ((avg by (instance,job)(irate(node_cpu_seconds_total{mode="idle",instance!~'bac-.*'}[5m]))) *100)) > 80
        for: 2m
        labels:
          severity: warning
        annotations:
          summary: "CPU使用率过高"
          description: "当前使用率{{ $value }}%"
    
      - alert: "磁盘使用率过高"
        expr: round(100-100*(node_filesystem_avail_bytes{fstype=~"ext4|xfs"} / node_filesystem_size_bytes{fstype=~"ext4|xfs"})) > 80
        for: 15s
        labels:
          severity: warning
        annotations:
          summary: "磁盘使用率过高"
          description: "当前磁盘{{$labels.mountpoint}} 使用率{{ $value }}%"
    
      - alert: "分区容量过低"
        expr: round(node_filesystem_avail_bytes{fstype=~"ext4|xfs",instance!~"testnode",mountpoint!~"/boot.*"}/1024/1024/1024) < 10
        for: 15s
        labels:
          severity: warning
        annotations:
          summary: "分区容量过低"
          description: "当前分区{{$labels.mountpoint}} 容量{{ $value }}GB"
    
      - alert: "网络流出速率过高"
        expr: round(irate(node_network_receive_bytes_total{instance!~"data.*",device!~'tap.*|veth.*|br.*|docker.*|vir.*|lo.*|vnet.*'}[1m])/1024) > 2048
        for: 1m
        labels:
          severity: warning
        annotations:
          summary: "网络流出速率过高"
          description: "当前速率{{ $value }}KB/s"
    
    
  • 相关阅读:
    ABAP ole方式对EXCEL进行操作
    ABAP 通过视图取数到内表函数
    ABAP 数值类型转换
    ABAP 供应商、工厂对应公里数维护
    deb包转换为rpm包格式
    Linux统计即时网速
    RedHat可用的几处软件源
    linux 技巧:使用 screen 管理你的远程会话
    国内常用Linux镜像站点
    telnet访问出现telnet:Unable to connect to remote host: No route to host
  • 原文地址:https://www.cnblogs.com/elvi/p/11444271.html
Copyright © 2011-2022 走看看