zoukankan      html  css  js  c++  java
  • flume kafka 配置指南

    1、官方网站也有配置:

     https://flume.apache.org/FlumeUserGuide.html#kafka-source

    2、clodera 官方配置

    https://www.cloudera.com/documentation/kafka/2-0-x/topics/kafka_flume.html

    1

     tier1.sources  = source1
     tier1.channels = channel1
     tier1.sinks = sink1
     
     tier1.sources.source1.type = org.apache.flume.source.kafka.KafkaSource
     tier1.sources.source1.zookeeperConnect = zk01.example.com:2181
     tier1.sources.source1.topic = weblogs
     tier1.sources.source1.groupId = flume
     tier1.sources.source1.channels = channel1
     tier1.sources.source1.interceptors = i1
     tier1.sources.source1.interceptors.i1.type = timestamp
     tier1.sources.source1.kafka.consumer.timeout.ms = 100
     
     tier1.channels.channel1.type = memory
     tier1.channels.channel1.capacity = 10000
     tier1.channels.channel1.transactionCapacity = 1000
     
     tier1.sinks.sink1.type = hdfs
     tier1.sinks.sink1.hdfs.path = /tmp/kafka/%{topic}/%y-%m-%d
     tier1.sinks.sink1.hdfs.rollInterval = 5
     tier1.sinks.sink1.hdfs.rollSize = 0
     tier1.sinks.sink1.hdfs.rollCount = 0
     tier1.sinks.sink1.hdfs.fileType = DataStream
     tier1.sinks.sink1.channel = channel1

    2

    tier1.sources  = source1
     tier1.channels = channel1
     tier1.sinks = sink1
     
     tier1.sources.source1.type = exec
     tier1.sources.source1.command = /usr/bin/vmstat 1
     tier1.sources.source1.channels = channel1
     
     tier1.channels.channel1.type = memory
     tier1.channels.channel1.capacity = 10000
     tier1.channels.channel1.transactionCapacity = 1000
     
     tier1.sinks.sink1.type = org.apache.flume.sink.kafka.KafkaSink
     tier1.sinks.sink1.topic = sink1
     tier1.sinks.sink1.brokerList = kafka01.example.com:9092,kafka02.example.com:9092
     tier1.sinks.sink1.channel = channel1
     tier1.sinks.sink1.batchSize = 20

    3

    tier1.sources = source1
    tier1.channels = channel1
    tier1.sinks = sink1
    
    tier1.sources.source1.type = exec
    tier1.sources.source1.command = /usr/bin/vmstat 1
    tier1.sources.source1.channels = channel1
    
    tier1.channels.channel1.type = org.apache.flume.channel.kafka.KafkaChannel
    tier1.channels.channel1.capacity = 10000
    tier1.channels.channel1.transactionCapacity = 1000
    tier1.channels.channel1.brokerList = kafka02.example.com:9092,kafka03.example.com:9092
    tier1.channels.channel1.topic = channel2
    tier1.channels.channel1.zookeeperConnect = zk01.example.com:2181
    tier1.channels.channel1.parseAsFlumeEvent = true
    
    tier1.sinks.sink1.type = hdfs
    tier1.sinks.sink1.hdfs.path = /tmp/kafka/channel
    tier1.sinks.sink1.hdfs.rollInterval = 5
    tier1.sinks.sink1.hdfs.rollSize = 0
    tier1.sinks.sink1.hdfs.rollCount = 0
    tier1.sinks.sink1.hdfs.fileType = DataStream
    tier1.sinks.sink1.channel = channel1

    总结:

     这样配置以后我们可以通过监控,或者消费kafka console打印效果

    /bin/kafka-console-consumer.sh --bootstrap-server master:9092 --topic topicTest2 --from-beginning

  • 相关阅读:
    poj3411
    2241 排序二叉树
    1004 四子连棋
    Poj1482
    poj2046
    Poj3087
    poj3414
    php使用flock堵塞写入文件和非堵塞写入文件
    HTML样式以及使用
    高效程序猿的狂暴之路
  • 原文地址:https://www.cnblogs.com/cbugs/p/9672462.html
Copyright © 2011-2022 走看看