zoukankan      html  css  js  c++  java
  • zabbix3.2监控rabbitmq集群

    监控模板和脚本github地址:https://github.com/jasonmcintosh/rabbitmq-zabbix/tree/master/scripts/rabbitmq
    
    
    1.将rabbitmq监控相关的文件拷贝到/usr/local/zabbix_agents_3.2.0/scripts目录下
    
    [root@eus_pe_web02:/usr/local/zabbix_agents_3.2.0/scripts]# ls -la rabbitmq/
    total 44
    drwxr-xr-x 2 root root  4096 Oct 23 04:42 .
    drwxr-xr-x 3 root root  4096 Oct 23 04:40 ..
    -rwxr-xr-x 1 root root 13763 Oct 23 04:40 api.py
    -rwxr-xr-x 1 root root   424 Oct 23 04:40 list_rabbit_nodes.sh
    -rwxr-xr-x 1 root root   426 Oct 23 04:40 list_rabbit_queues.sh
    -rwxr-xr-x 1 root root   430 Oct 23 04:40 list_rabbit_shovels.sh
    -rw-r--r-- 1 root root   172 Oct 23 04:42 .rab.auth
    -rwxr-xr-x 1 root root   782 Oct 23 04:40 rabbitmq-status.sh
    
    2.添加api配置文件(rabbitmq的账户信息)
    .rab.auth
    
    USERNAME=admin
    PASSWORD=admin
    CONF=/usr/local/zabbix_agents_3.2.0/conf/zabbix_agentd.conf
    LOGLEVEL=INFO
    LOGFILE=/var/log/zabbix/rabbitmq_zabbix.log
    PORT=15672
    
    3.添加配置文件
    [root@eus_pe_web02:/usr/local/zabbix_agents_3.2.0/scripts]# cat ../conf/zabbix_agentd/zabbix-rabbitmq.conf 
    UserParameter=rabbitmq.discovery_queues,/usr/local/zabbix_agents_3.2.0/scripts/rabbitmq/list_rabbit_queues.sh
    UserParameter=rabbitmq.discovery_shovels,/usr/local/zabbix_agents_3.2.0/scripts/rabbitmq/list_rabbit_shovels.sh
    UserParameter=rabbitmq.discovery_nodes,/usr/local/zabbix_agents_3.2.0/scripts/rabbitmq/list_rabbit_nodes.sh
    UserParameter=rabbitmq[*],/usr/local/zabbix_agents_3.2.0/scripts/rabbitmq/rabbitmq-status.sh $1 $2 $3
    
    4.导入rabbitmq模板
    
    5.重新启动zabbix-agent,并在被监控机器添加监控模板即可
    
    # 主要的api接口
    # cat api.py
    
    #!/usr/bin/env /usr/bin/python
    '''Python module to query the RabbitMQ Management Plugin REST API and get
    results that can then be used by Zabbix.
    https://github.com/jasonmcintosh/rabbitmq-zabbix
    '''
    from __future__ import unicode_literals
    
    import io
    import json
    import optparse
    import socket
    import urllib2
    import subprocess
    import os
    import logging
    
    
    class RabbitMQAPI(object):
        '''Class for RabbitMQ Management API'''
    
        def __init__(self, user_name='guest', password='guest', host_name='',
                     port=15672, conf='/usr/local/zabbix_agents_3.2.0/conf/zabbix_agentd.conf', senderhostname=None, protocol='http'):
            self.user_name = user_name
            self.password = password
            self.host_name = host_name or socket.gethostname()
            self.port = port
            self.conf = conf or '/usr/local/zabbix_agents_3.2.0/conf/zabbix_agentd.conf'
            self.senderhostname = senderhostname or socket.gethostname()
            self.protocol = protocol or 'http'
    
        def call_api(self, path):
            '''Call the REST API and convert the results into JSON.'''
            url = '{0}://{1}:{2}/api/{3}'.format(self.protocol, self.host_name, self.port, path)
            password_mgr = urllib2.HTTPPasswordMgrWithDefaultRealm()
            password_mgr.add_password(None, url, self.user_name, self.password)
            handler = urllib2.HTTPBasicAuthHandler(password_mgr)
            logging.debug('Issue a rabbit API call to get data on ' + path + " against " + self.host_name)
            logging.debug('Full URL:' + url)
            return json.loads(urllib2.build_opener(handler).open(url).read())
    
        def list_queues(self, filters=None):
            '''
            List all of the RabbitMQ queues, filtered against the filters provided
            in .rab.auth. See README.md for more information.
            '''
            queues = []
            if not filters:
                filters = [{}]
            for queue in self.call_api('queues'):
                logging.debug("Discovered queue " + queue['name'] + ", checking to see if it's filtered...")
                for _filter in filters:
                    check = [(x, y) for x, y in queue.items() if x in _filter]
                    shared_items = set(_filter.items()).intersection(check)
                    if len(shared_items) == len(_filter):
                        element = {'{#NODENAME}': queue['node'],
                                   '{#VHOSTNAME}': queue['vhost'],
                                   '{#QUEUENAME}': queue['name']}
                        queues.append(element)
                        logging.debug('Discovered queue '+queue['vhost']+'/'+queue['name'])
                        break
            return queues
    
        def list_shovels(self, filters=None):
            '''
            List all of the RabbitMQ shovels, filtered against the filters provided
            in .rab.auth. See README.md for more information.
            '''
            shovels = []
            if not filters:
                filters = [{}]
            try:
                for shovel in self.call_api('shovels'):
                    logging.debug("Discovered shovel " + shovel['name'] + ", checking to see if it's filtered...")
                    for _filter in filters:
                        check = [(x, y) for x, y in shovel.items() if x in _filter]
                        shared_items = set(_filter.items()).intersection(check)
                        if len(shared_items) == len(_filter):
                            element = {'{#VHOSTNAME}': shovel['vhost'],
                                       '{#SHOVELNAME}': shovel['name']}
                            shovels.append(element)
                            logging.debug('Discovered shovel '+shovel['vhost']+'/'+shovel['name'])
                            break
                return shovels
            except urllib2.HTTPError as err:
                if err.code == 404:
                    return shovels
                else:
                    raise err
    
        def list_nodes(self):
            '''Lists all rabbitMQ nodes in the cluster'''
            nodes = []
            for node in self.call_api('nodes'):
                # We need to return the node name, because Zabbix
                # does not support @ as an item parameter
                name = node['name'].split('@')[1]
                element = {'{#NODENAME}': name,
                           '{#NODETYPE}': node['type']}
                nodes.append(element)
                logging.debug('Discovered nodes '+name+'/'+node['type'])
            return nodes
    
        def check_queue(self, filters=None):
            '''Return the value for a specific item in a queue's details.'''
            return_code = 0
            if not filters:
                filters = [{}]
    
            buffer = io.StringIO()
    
            try:
                for queue in self.call_api('queues'):
                    success = False
                    logging.debug("Filtering out by " + str(filters))
                    for _filter in filters:
                        check = [(x, y) for x, y in queue.items() if x in _filter]
                        shared_items = set(_filter.items()).intersection(check)
                        if len(shared_items) == len(_filter):
                            success = True
                            break
                    if success:
                        self._prepare_data(queue, buffer)
            except urllib2.HTTPError as err:
                if err.code == 404:
                    buffer.close()
                    return return_code
                else:
                    raise err
    
            return_code = self._send_data(buffer)
            buffer.close()
            return return_code
    
        def check_shovel(self, filters=None):
            '''Return the value for a specific item in a shovel's details.'''
            return_code = 0
            if not filters:
                filters = [{}]
    
            buffer = io.StringIO()
    
            try:
                for shovel in self.call_api('shovels'):
                    success = False
                    logging.debug("Filtering out by " + str(filters))
                    for _filter in filters:
                        check = [(x, y) for x, y in shovel.items() if x in _filter]
                        shared_items = set(_filter.items()).intersection(check)
                        if len(shared_items) == len(_filter):
                            success = True
                            break
                    if success:
                        key = '"rabbitmq.shovels[{0},shovel_{1},{2}]"'
                        key = key.format(shovel['vhost'], 'state', shovel['name'])
                        value = shovel.get('state', 0)
                        logging.debug("SENDER_DATA: - %s %s" % (key,value))
                        buffer.write("- %s %s
    " % (key, value))
            except urllib2.HTTPError as err:
                if err.code == 404:
                    buffer.close()
                    return return_code
                else:
                    raise err
    
            return_code = self._send_data(buffer)
            buffer.close()
            return return_code
    
        def _prepare_data(self, queue, file):
            '''Prepare the queue data for sending'''
            for item in ['memory', 'messages', 'messages_unacknowledged',
                         'consumers']:
                key = '"rabbitmq.queues[{0},queue_{1},{2}]"'
                key = key.format(queue['vhost'], item, queue['name'])
                value = queue.get(item, 0)
                logging.debug("SENDER_DATA: - %s %s" % (key,value))
                file.write("- %s %s
    " % (key, value))
            ##  This is a non standard bit of information added after the standard items
            for item in ['deliver_get', 'publish', 'ack']:
                key = '"rabbitmq.queues[{0},queue_message_stats_{1},{2}]"'
                key = key.format(queue['vhost'], item, queue['name'])
                value = queue.get('message_stats', {}).get(item, 0)
                logging.debug("SENDER_DATA: - %s %s" % (key,value))
                file.write("- %s %s
    " % (key, value))
    
        def _send_data(self, file):
            '''Send the queue data to Zabbix.'''
            args = 'zabbix_sender -vv -c {0} -i -'
            if self.senderhostname:
                args = args + " -s " + self.senderhostname
            return_code = 0
            process = subprocess.Popen(args.format(self.conf),
                                               shell=True,
                                               stdin=subprocess.PIPE,
                                               stdout=subprocess.PIPE,
                                               stderr=subprocess.PIPE)
            out, err = process.communicate(input=file.getvalue())
            logging.debug("Finished sending data")
            return_code = process.wait()
            logging.info("Found return code of " + str(return_code))
            if return_code == 1:
                logging.error(out)
                logging.error(err)
            else:
                logging.debug(err)
                logging.debug(out)
            return return_code
    
        def check_aliveness(self):
            '''Check the aliveness status of a given vhost.'''
            return self.call_api('aliveness-test/%2f')['status']
    
        def check_server(self, item, node_name):
            '''First, check the overview specific items'''
            if item == 'message_stats_deliver_get':
              return self.call_api('overview').get('message_stats', {}).get('deliver_get_details', {}).get('rate',0)
            elif item == 'message_stats_publish':
              return self.call_api('overview').get('message_stats', {}).get('publish_details', {}).get('rate',0)
            elif item == 'message_stats_ack':
              return self.call_api('overview').get('message_stats', {}).get('ack_details', {}).get('rate',0)
            elif item == 'message_count_total':
              return self.call_api('overview').get('queue_totals', {}).get('messages',0)
            elif item == 'message_count_ready':
              return self.call_api('overview').get('queue_totals', {}).get('messages_ready',0)
            elif item == 'message_count_unacknowledged':
              return self.call_api('overview').get('queue_totals', {}).get('messages_unacknowledged',0)
            elif item == 'rabbitmq_version':
              return self.call_api('overview').get('rabbitmq_version', 'None')
            '''Return the value for a specific item in a node's details.'''
            node_name = node_name.split('.')[0]
            nodeInfo = self.call_api('nodes')
            for nodeData in nodeInfo:
                logging.debug("Checking to see if node name {0} is in {1} for item {2} found {3} nodes".format(node_name, nodeData['name'], item, len(nodeInfo)))
                if node_name in nodeData['name'] or len(nodeInfo) == 1:
                    logging.debug("Got data from node {0} of {1} ".format(node_name, nodeData.get(item)))
                    return nodeData.get(item)
            return 'Not Found'
    
    
    def main():
        '''Command-line parameters and decoding for Zabbix use/consumption.'''
        choices = ['list_queues', 'list_shovels', 'list_nodes', 'queues', 'shovels', 'check_aliveness',
                   'server']
        parser = optparse.OptionParser()
        parser.add_option('--username', help='RabbitMQ API username', default='esignnew')
        parser.add_option('--password', help='RabbitMQ API password', default='tAjFxsjEE4JXzXp')
        parser.add_option('--hostname', help='RabbitMQ API host', default=socket.gethostname())
        parser.add_option('--protocol', help='Use http or https', default='http')
        parser.add_option('--port', help='RabbitMQ API port', type='int', default=15672)
        parser.add_option('--check', type='choice', choices=choices, help='Type of check')
        parser.add_option('--metric', help='Which metric to evaluate', default='')
        parser.add_option('--filters', help='Filter used queues (see README)')
        parser.add_option('--node', help='Which node to check (valid for --check=server)')
        parser.add_option('--conf', default='/usr/local/zabbix_agents_3.2.0/conf/zabbix_agentd.conf')
        parser.add_option('--senderhostname', default='', help='Allows including a sender parameter on calls to zabbix_sender')
        parser.add_option('--logfile', help='File to log errors (defaults to /var/log/zabbix/rabbitmq_zabbix.log)', default='/var/log/zabbix/rabbitmq_zabbix.log')
        parser.add_option('--loglevel', help='Defaults to INFO', default='INFO')
        (options, args) = parser.parse_args()
        if not options.check:
            parser.error('At least one check should be specified')
        logging.basicConfig(filename=options.logfile or "/var/log/zabbix/rabbitmq_zabbix.log", level=logging.getLevelName(options.loglevel or "INFO"), format='%(asctime)s %(levelname)s: %(message)s')
    
        logging.debug("Started trying to process data")
        api = RabbitMQAPI(user_name=options.username, password=options.password,
                          host_name=options.hostname, port=options.port,
                          conf=options.conf, senderhostname=options.senderhostname,
                 protocol=options.protocol)
        if options.filters:
            try:
                filters = json.loads(options.filters)
            except KeyError:
                parser.error('Invalid filters object.')
        else:
            filters = [{}]
        if not isinstance(filters, (list, tuple)):
            filters = [filters]
        if options.check == 'list_queues':
            print json.dumps({'data': api.list_queues(filters)})
        elif options.check == 'list_nodes':
            print json.dumps({'data': api.list_nodes()})
        elif options.check == 'list_shovels':
            print json.dumps({'data': api.list_shovels()})
        elif options.check == 'queues':
            print api.check_queue(filters)
        elif options.check == 'shovels':
            print api.check_shovel(filters)
        elif options.check == 'check_aliveness':
            print api.check_aliveness()
        elif options.check == 'server':
            if not options.metric:
                parser.error('Missing required parameter: "metric"')
            else:
                if options.node:
                    print api.check_server(options.metric, options.node)
                else:
                    print api.check_server(options.metric, api.host_name)
    
    if __name__ == '__main__':
        main()

     上面的方法如果当rabbitmq的队列很多,自动发现会有几百个监控项,对服务器造成很大压力,有一种简单的方式获取队列

    # 获取rabbitmq积压队列个数
    UserParameter=queues_num,/usr/sbin/rabbitmqctl list_queues | grep -v delay | grep -v queues >> /tmp/rabbitmq.txt && awk '{total+=$2}END{print total}' /tmp/rabbitmq.txt

    可以配置当队列数大于30或根据业务配置合适的值触发报警

    上面的方法在zabbix_get的时候会报错

    [root~]# zabbix_get -s 1.1.1.1 -k 'queues_num'
    erlexec: HOME must be set
    0

    采用下面的方法获取队列

    1.配置计划任务

    # 3分钟获取一次rabbitmq的队列
    */3 * * * * /usr/sbin/rabbitmqctl list_queues | grep -v delay | grep -v queues > /tmp/rabbitmq.txt

    2.配置item

    UserParameter=queues_num,awk '{total+=$2}END{print total}' /tmp/rabbitmq.txt

    # 有时候通过rabbitmqctl命令获取不到数据,此时我们可以通过 rabbitmqadmin 这个工具来输入账号密码来获取具体的信息
    # cd /usr/bin/
    # wget https://raw.githubusercontent.com/rabbitmq/rabbitmq-management/v3.7.8/bin/rabbitmqadmin

    /usr/bin/rabbitmqadmin -H 1.1.1.1 -P 15672 -u rabbit -p 'pass' -f tsv -q list queues >> /tmp/rabbitmq.txt && awk '{total+=$2}END{print total}' /tmp/rabbitmq.txt

    # 获取rabbitmq积压队列个数
    UserParameter=queues_num,/usr/bin/rabbitmqadmin -H 1.1.1.1 -P 15672 -u rabbit -p 'pass' -f tsv -q list queues >> /tmp/rabbitmq.txt && awk '{total+=$2}END{print total}' /tmp/rabbitmq.txt

    可以配置当队列数大于30或根据业务配置合适的值触发报警

    采用下面的方法获取队列

    1.配置计划任务

    # 3分钟获取一次rabbitmq的队列
    */3 * * * * /usr/bin/rabbitmqadmin -H 172.20.73.208 -P 15672 -u rabbit -p 'pass' -f tsv -q list queues > /tmp/rabbitmq.txt

    2.配置item
    UserParameter=queues_num,awk '{total+=$2}END{print total}' /tmp/rabbitmq.txt

  • 相关阅读:
    从1.5k到18k, 一个程序员的5年成长之路
    我是如何准备技术面试的
    10个惊艳的Ruby单行代码
    经典Spring面试题和答案
    数据分析应该要避免的6个错误
    代码重构的实战经验和那些坑
    勾勒物联网与大数据的数据中心路线图
    共筑Spark大数据引擎的七大工具
    es6学习总结(一)
    vue-cli搭建与vue-router(路由配置)
  • 原文地址:https://www.cnblogs.com/reblue520/p/9860257.html
Copyright © 2011-2022 走看看