zoukankan      html  css  js  c++  java
  • Flume监控之Ganglia

    4.1 Ganglia的安装与部署

    1) 安装httpd服务与php

    [FLY@hadoop102 flume]$ sudo yum -y install httpd php 

    2) 安装其他依赖

    [FLY@hadoop102 flume]$ sudo yum -y install rrdtool perl-rrdtool rrdtool-devel
    [FLY@hadoop102 flume]$ sudo yum -y install apr-devel

    3) 安装ganglia

    [FLY@hadoop102 flume]$ sudo rpm -Uvh http://dl.fedoraproject.org/pub/epel/6/x86_64/epel-release-6-8.noarch.rpm
    [FLY@hadoop102 flume]$ sudo yum -y install ganglia-gmetad 
    [FLY@hadoop102 flume]$ sudo yum -y install ganglia-web
    [FLY@hadoop102 flume]$ sudo yum install -y ganglia-gmond 

    4) 修改配置文件/etc/httpd/conf.d/ganglia.conf

    [FLY@hadoop102 flume]$ sudo vim /etc/httpd/conf.d/ganglia.conf

    # Ganglia monitoring system php web frontend

    Alias /ganglia /usr/share/ganglia

    <Location /ganglia>

      Order deny,allow

      Deny from all

      Allow from all

      # Allow from 127.0.0.1

      # Allow from ::1

      # Allow from .example.com

    </Location>

     

    5) 修改配置文件/etc/ganglia/gmetad.conf

    [FLY@hadoop102 flume]$ sudo vim /etc/ganglia/gmetad.conf
    

    修改为:

    data_source "hadoop102" 192.168.1.102
    

      

    6) 修改配置文件/etc/ganglia/gmond.conf

    [FLY@hadoop102 flume]$ sudo vim /etc/ganglia/gmond.conf 

     

    修改为:

    cluster {
      name = "hadoop102"
      owner = "unspecified"
      latlong = "unspecified"
      url = "unspecified"
    }
    udp_send_channel {
      #bind_hostname = yes # Highly recommended, soon to be default.
                           # This option tells gmond to use a source address
                           # that resolves to the machine's hostname.  Without
                           # this, the metrics may appear to come from any
                           # interface and the DNS names associated with
                           # those IPs will be used to create the RRDs.
      # mcast_join = 239.2.11.71
      host = 192.168.1.102
      port = 8649
      ttl = 1
    }
    udp_recv_channel {
      # mcast_join = 239.2.11.71
      port = 8649
      bind = 192.168.1.102
      retry_bind = true
      # Size of the UDP buffer. If you are handling lots of metrics you really
      # should bump it up to e.g. 10MB or even higher.
      # buffer = 10485760
    }

     

    7) 修改配置文件/etc/selinux/config

    [FLY@hadoop102 flume]$ sudo vim /etc/selinux/config
    

    修改为:

    # This file controls the state of SELinux on the system.
    # SELINUX= can take one of these three values:
    #     enforcing - SELinux security policy is enforced.
    #     permissive - SELinux prints warnings instead of enforcing.
    #     disabled - No SELinux policy is loaded.
    SELINUX=disabled
    # SELINUXTYPE= can take one of these two values:
    #     targeted - Targeted processes are protected,
    #     mls - Multi Level Security protection.
    SELINUXTYPE=targeted
    

     

    尖叫提示:selinux本次生效关闭必须重启,如果此时不想重启,可以临时生效之:

    [FLY@hadoop102 flume]$ sudo setenforce 0

     

    5) 启动ganglia

      

    [FLY@hadoop102 flume]$ sudo service httpd start
    [FLY@hadoop102 flume]$ sudo service gmetad start
    [FLY@hadoop102 flume]$ sudo service gmond start

     

    6) 打开网页浏览ganglia页面

    http://192.168.1.102/ganglia

    尖叫提示:如果完成以上操作依然出现权限不足错误,请修改/var/lib/ganglia目录的权限:

     [FLY@hadoop102 flume]$ sudo chmod -R 777 /var/lib/ganglia
    

    4.2 操作Flume测试监控

    1) 修改/opt/module/flume/conf目录下的flume-env.sh配置:

    JAVA_OPTS="-Dflume.monitoring.type=ganglia
    -Dflume.monitoring.hosts=192.168.1.102:8649
    -Xms100m
    -Xmx200m"

      

    2) 启动Flume任务

    [FLY@hadoop102 flume]$ bin/flume-ng agent 
    --conf conf/ 
    --name a1 
    --conf-file job/flume-telnet-logger.conf 
    -Dflume.root.logger==INFO,console 
    -Dflume.monitoring.type=ganglia 
    -Dflume.monitoring.hosts=192.168.1.102:8649

    3) 发送数据观察ganglia监测图

    [FLY@hadoop102 flume]$ telnet localhost 44444
    

     

    样式如图:

    图例说明:

    字段(图表名称)

    字段含义

    EventPutAttemptCount

    source尝试写入channel的事件总数量

    EventPutSuccessCount

    成功写入channel且提交的事件总数量

    EventTakeAttemptCount

    sink尝试从channel拉取事件的总数量。这不意味着每次事件都被返回,因为sink拉取的时候channel可能没有任何数据。

    EventTakeSuccessCount

    sink成功读取的事件的总数量

    StartTime

    channel启动的时间(毫秒)

    StopTime

    channel停止的时间(毫秒)

    ChannelSize

    目前channel中事件的总数量

    ChannelFillPercentage

    channel占用百分比

    ChannelCapacity

    channel的容量

  • 相关阅读:
    POJ1201 Intervals & TYVJ 1589 西瓜种植 差分约束
    洛谷 P3385 【模板】负环 SPFA
    Test 2018-07-19 二中集训
    《一出好戏》一中高三学子誓师大会后记
    [NOI2012] 迷失游乐园 概率 期望 基环树DP
    codeforces CF983E NN country 树上倍增
    「Luogu P3521 [POI2011]ROT-Tree Rotations」
    「高等数学学习笔记 DAY18」
    「高等数学学习笔记 DAY17」
    「高等数学学习笔记 DAY16」
  • 原文地址:https://www.cnblogs.com/tesla-turing/p/11668151.html
Copyright © 2011-2022 走看看