zoukankan      html  css  js  c++  java
  • prometheus operator监控coredns

    1、创建service

    apiVersion: v1
    kind: Service
    metadata:
      labels:
        app: prometheus-operator-coredns
        chart: prometheus-operator-7.4.0
        heritage: Tiller
        jobLabel: coredns
        release: pharos-prometheus-operator
      name: pharos-prometheus-operator-coredns
      namespace: kube-system
    spec:
      clusterIP: None
      ports:
      - name: http-metrics
        port: 9153
        protocol: TCP
        targetPort: 9153
      selector:
        k8s-app: kube-dns
      sessionAffinity: None
      type: ClusterIP
    

     2、创建serviceMonitor

    apiVersion: monitoring.coreos.com/v1
    kind: ServiceMonitor
    metadata:
      labels:
        app: prometheus-operator-coredns
        chart: prometheus-operator-7.4.0
        heritage: Tiller
        release: pharos-prometheus-operator
      name: pharos-prometheus-operator-coredns
      namespace: cattle-prometheus
    spec:
      endpoints:
      - bearerTokenFile: /var/run/secrets/kubernetes.io/serviceaccount/token
        port: http-metrics
      jobLabel: jobLabel
      namespaceSelector:
        matchNames:
        - kube-system
      selector:
        matchLabels:
          app: prometheus-operator-coredns
          release: pharos-prometheus-operator
    

    3、创建 prometheusrules

    apiVersion: monitoring.coreos.com/v1
    kind: PrometheusRule
    metadata:
      annotations:
        prometheus-operator-validated: "true"
      labels:
        app: prometheus-operator
        release: pharos-prometheus-operator
      name: coredns-panic
      namespace: cattle-prometheus
    spec:
      groups:
      - name: coredns-panic
        rules:
        - alert: coredns-panic
          annotations:
            description: Coredns panic count is {{ $value }},please check!
          expr: sum by(instance, job, namespace) (coredns_panic_count_total) >= 10
          for: 15m
          labels:
            alertname: coredns-panic
            severity: warning
    ---
    apiVersion: monitoring.coreos.com/v1
    kind: PrometheusRule
    metadata:
      annotations:
        prometheus-operator-validated: "true"
      labels:
        app: prometheus-operator
        release: pharos-prometheus-operator
      name: coredns-responses
      namespace: cattle-prometheus
    spec:
      groups:
      - name: coredns-responses
        rules:
        - alert: coredns-responses
          annotations:
            description: Coredns has longer response time, please check !
          expr: histogram_quantile(0.99, sum by(le, job) (rate(coredns_dns_request_duration_seconds_bucket[5m])))  *
            10 >= 3
          for: 15m
          labels:
            alertname: coredns-responses
            severity: warning
    
  • 相关阅读:
    opencv计算机视觉学习笔记一
    opencv_python学习笔记十八
    opencv_python学习笔记十六
    Android-锁屏功能
    Android-加载透明PNG图片变黑的问题
    Android-AttributeSet详解
    Android-UGallery
    Android-相册效果(图片缩放 自由滑动)
    Android进阶篇-内存管理
    Android进阶篇-时间滑动控件
  • 原文地址:https://www.cnblogs.com/Wshile/p/13268510.html
Copyright © 2011-2022 走看看