zoukankan      html  css  js  c++  java
  • flume kafka 配置指南

    1、官方网站也有配置:

     https://flume.apache.org/FlumeUserGuide.html#kafka-source

    2、clodera 官方配置

    https://www.cloudera.com/documentation/kafka/2-0-x/topics/kafka_flume.html

    1

     tier1.sources  = source1
     tier1.channels = channel1
     tier1.sinks = sink1
     
     tier1.sources.source1.type = org.apache.flume.source.kafka.KafkaSource
     tier1.sources.source1.zookeeperConnect = zk01.example.com:2181
     tier1.sources.source1.topic = weblogs
     tier1.sources.source1.groupId = flume
     tier1.sources.source1.channels = channel1
     tier1.sources.source1.interceptors = i1
     tier1.sources.source1.interceptors.i1.type = timestamp
     tier1.sources.source1.kafka.consumer.timeout.ms = 100
     
     tier1.channels.channel1.type = memory
     tier1.channels.channel1.capacity = 10000
     tier1.channels.channel1.transactionCapacity = 1000
     
     tier1.sinks.sink1.type = hdfs
     tier1.sinks.sink1.hdfs.path = /tmp/kafka/%{topic}/%y-%m-%d
     tier1.sinks.sink1.hdfs.rollInterval = 5
     tier1.sinks.sink1.hdfs.rollSize = 0
     tier1.sinks.sink1.hdfs.rollCount = 0
     tier1.sinks.sink1.hdfs.fileType = DataStream
     tier1.sinks.sink1.channel = channel1

    2

    tier1.sources  = source1
     tier1.channels = channel1
     tier1.sinks = sink1
     
     tier1.sources.source1.type = exec
     tier1.sources.source1.command = /usr/bin/vmstat 1
     tier1.sources.source1.channels = channel1
     
     tier1.channels.channel1.type = memory
     tier1.channels.channel1.capacity = 10000
     tier1.channels.channel1.transactionCapacity = 1000
     
     tier1.sinks.sink1.type = org.apache.flume.sink.kafka.KafkaSink
     tier1.sinks.sink1.topic = sink1
     tier1.sinks.sink1.brokerList = kafka01.example.com:9092,kafka02.example.com:9092
     tier1.sinks.sink1.channel = channel1
     tier1.sinks.sink1.batchSize = 20

    3

    tier1.sources = source1
    tier1.channels = channel1
    tier1.sinks = sink1
    
    tier1.sources.source1.type = exec
    tier1.sources.source1.command = /usr/bin/vmstat 1
    tier1.sources.source1.channels = channel1
    
    tier1.channels.channel1.type = org.apache.flume.channel.kafka.KafkaChannel
    tier1.channels.channel1.capacity = 10000
    tier1.channels.channel1.transactionCapacity = 1000
    tier1.channels.channel1.brokerList = kafka02.example.com:9092,kafka03.example.com:9092
    tier1.channels.channel1.topic = channel2
    tier1.channels.channel1.zookeeperConnect = zk01.example.com:2181
    tier1.channels.channel1.parseAsFlumeEvent = true
    
    tier1.sinks.sink1.type = hdfs
    tier1.sinks.sink1.hdfs.path = /tmp/kafka/channel
    tier1.sinks.sink1.hdfs.rollInterval = 5
    tier1.sinks.sink1.hdfs.rollSize = 0
    tier1.sinks.sink1.hdfs.rollCount = 0
    tier1.sinks.sink1.hdfs.fileType = DataStream
    tier1.sinks.sink1.channel = channel1

    总结:

     这样配置以后我们可以通过监控,或者消费kafka console打印效果

    /bin/kafka-console-consumer.sh --bootstrap-server master:9092 --topic topicTest2 --from-beginning

  • 相关阅读:
    linux基础学习-8.1-无法远程连接服务器常见原因
    schema约束文档 根元素的写法
    用双重for循环生成九九乘法表
    Random类、String类的一些常用方法
    Random()方法结合Scanner类实现猜数游戏
    使用System类和Date类来计算自己从出生到现在度过了多少时间
    获取map集合中键和值的三种方式
    斗地主实现洗牌发牌功能
    集合框架<一>
    动手动脑2
  • 原文地址:https://www.cnblogs.com/cbugs/p/9672462.html
Copyright © 2011-2022 走看看