zoukankan      html  css  js  c++  java
  • Prometheus Node_exporter 之 Basic CPU / Mem / Disk Gauge

    1. CPU Busy :收集所有 cpu 内核 busy 状态占比


    type: Singlestat
    Unit: perent(0-100)
    (所有 cpu使用情况 - 5分钟内 cpu 空闲的平均值) / 所有 cpu使用情况
    metrics:

    (((count(count(node_cpu_seconds_total{instance=~"$node:$port",job=~"$job"}) by (cpu))) - avg(sum by (mode)(irate(node_cpu_seconds_total{mode='idle',instance=~"$node:$port",job=~"$job"}[5m])))) * 100) / count(count(node_cpu_seconds_total{instance=~"$node:$port",job=~"$job"}) by (cpu))

    最大值: 100%

    2. Used RAM Memory free -m


    type: Singlestat
    Unit: perent(0-100)
    已使用的内存占比(包括Buffer缓存和Cached缓存)
    metrics:

    ((node_memory_MemTotal_bytes{instance=~"$node:$port",job=~"$job"} - node_memory_MemFree_bytes{instance=~"$node:$port",job=~"$job"}) / (node_memory_MemTotal_bytes{instance=~"$node:$port",job=~"$job"} )) * 100


    node_memory_MemFree_bytes 空闲内存

    已使用的内存占比(不包括Buffer缓存和Cached缓存)
    metrics:

    100 - ((node_memory_MemAvailable_bytes{instance=~"$node:$port",job=~"$job"} * 100) / node_memory_MemTotal_bytes{instance=~"$node:$port",job=~"$job"})

    MemAvailable: Free + Buffers + Cached - 不可回收的部分。不可回收部分包括:共享内存段,tmpfs,ramfs等

    3. Used SWAP: 交换分区使用率


    type: Singlestat
    Unit: perent(0-100)
    metrics:

    ((node_memory_SwapTotal_bytes{instance=~"$node:$port",job=~"$job"} - node_memory_SwapFree_bytes{instance=~"$node:$port",job=~"$job"}) / (node_memory_SwapTotal_bytes{instance=~"$node:$port",job=~"$job"} )) * 100

    node_memory_SwapFree_bytes 交换分区的空闲大小

    4. Used Root FS 根文件系统使用率


    type: Singlestat
    Unit: perent(0-100)
    metrics:

    100 - ((node_filesystem_avail_bytes{instance=~"$node:$port",job=~"$job",mountpoint="/",fstype!="rootfs"} * 100) / node_filesystem_size_bytes{instance=~"$node:$port",job=~"$job",mountpoint="/",fstype!="rootfs"})

    node_filesystem_avail_bytes 文件系统可用空间

    5. CPU System Load (1m avg) 一分钟内 CPU 所有内核的平均负载率


    type: Singlestat
    Unit: perent(0-100)
    metrics:

    avg(node_load1{instance=~"$node:$port",job=~"$job"}) / count(count(node_cpu_seconds_total{instance=~"$node:$port",job=~"$job"}) by (cpu)) * 100

    node_load1 : 系统一分钟内的负载

    6. CPU System Load (5m avg) 五分钟内 CPU 所有内核的平均负载率


    type: Singlestat
    Unit: perent(0-100)
    metrics:

    avg(node_load5{instance=~"$node:$port",job=~"$job"}) / count(count(node_cpu_seconds_total{instance=~"$node:$port",job=~"$job"}) by (cpu)) * 100

    node_load5 : 指5分钟内cpu的负载

    1. CPU Busy :收集所有 cpu 内核 busy 状态占比



    type: Singlestat
    Unit: perent(0-100)
    (所有 cpu使用情况 - 5分钟内 cpu 空闲的平均值) / 所有 cpu使用情况
    metrics:

    (((count(count(node_cpu_seconds_total{instance=~"$node:$port",job=~"$job"}) by (cpu))) - avg(sum by (mode)(irate(node_cpu_seconds_total{mode='idle',instance=~"$node:$port",job=~"$job"}[5m])))) * 100) / count(count(node_cpu_seconds_total{instance=~"$node:$port",job=~"$job"}) by (cpu))


    最大值: 100%

    2. Used RAM Memory free -m



    type: Singlestat
    Unit: perent(0-100)
    已使用的内存占比(包括Buffer缓存和Cached缓存)
    metrics:

    ((node_memory_MemTotal_bytes{instance=~"$node:$port",job=~"$job"} - node_memory_MemFree_bytes{instance=~"$node:$port",job=~"$job"}) / (node_memory_MemTotal_bytes{instance=~"$node:$port",job=~"$job"} )) * 100


    node_memory_MemFree_bytes 空闲内存

    已使用的内存占比(不包括Buffer缓存和Cached缓存)
    metrics:

    100 - ((node_memory_MemAvailable_bytes{instance=~"$node:$port",job=~"$job"} * 100) / node_memory_MemTotal_bytes{instance=~"$node:$port",job=~"$job"})


    MemAvailable: Free + Buffers + Cached - 不可回收的部分。不可回收部分包括:共享内存段,tmpfs,ramfs等

    3. Used SWAP: 交换分区使用率



    type: Singlestat
    Unit: perent(0-100)
    metrics:

    ((node_memory_SwapTotal_bytes{instance=~"$node:$port",job=~"$job"} - node_memory_SwapFree_bytes{instance=~"$node:$port",job=~"$job"}) / (node_memory_SwapTotal_bytes{instance=~"$node:$port",job=~"$job"} )) * 100


    node_memory_SwapFree_bytes 交换分区的空闲大小

    4. Used Root FS 根文件系统使用率



    type: Singlestat
    Unit: perent(0-100)
    metrics:

    100 - ((node_filesystem_avail_bytes{instance=~"$node:$port",job=~"$job",mountpoint="/",fstype!="rootfs"} * 100) / node_filesystem_size_bytes{instance=~"$node:$port",job=~"$job",mountpoint="/",fstype!="rootfs"})


    node_filesystem_avail_bytes 文件系统可用空间

    5. CPU System Load (1m avg) 一分钟内 CPU 所有内核的平均负载率



    type: Singlestat
    Unit: perent(0-100)
    metrics:

    avg(node_load1{instance=~"$node:$port",job=~"$job"}) / count(count(node_cpu_seconds_total{instance=~"$node:$port",job=~"$job"}) by (cpu)) * 100


    node_load1 : 系统一分钟内的负载

    6. CPU System Load (5m avg) 五分钟内 CPU 所有内核的平均负载率



    type: Singlestat
    Unit: perent(0-100)
    metrics:

    avg(node_load5{instance=~"$node:$port",job=~"$job"}) / count(count(node_cpu_seconds_total{instance=~"$node:$port",job=~"$job"}) by (cpu)) * 100


    node_load5 : 指5分钟内cpu的负载

  • 相关阅读:
    Java中的static关键字解析
    Hadoop记录-metastore jmx配置
    Hadoop记录-hadoop jmx配置
    Hadoop记录-yarn ResourceManager Active频繁易主问题排查(转载)
    Hadoop记录-hive merge小文件
    Linux记录-salt命令
    Hadoop记录-Hadoop监控指标汇总
    Hadoop记录-日常运维操作
    Hadoop记录-技术网站
    Hadoop记录-Hadoop jmx
  • 原文地址:https://www.cnblogs.com/qianyuliang/p/10479553.html
Copyright © 2011-2022 走看看