zoukankan      html  css  js  c++  java
  • 使用kafka作为生产者生产数据到hdfs(单节点)

    关键:查看kafka官网的userguide

    agent.sources = kafkaSource
    agent.channels = memoryChannel
    agent.sinks = hdfsSink

    agent.sources.kafkaSource.type = org.apache.flume.source.kafka.KafkaSource
    agent.sources.kafkaSource.zookeeperConnect = 192.168.57.11:2181
    agent.sources.kafkaSource.topic = test_pan
    agent.sources.kafkaSource.groupId = test-consumer-group
    agent.sources.kafkaSource.kafka.consumer.timeout.ms = 100

    agent.channels.memoryChannel.type = memory
    agent.channels.memoryChannel.capacity=100
    agent.channels.memoryChannel.transactionCapacity=100

    agent.sinks.hdfsSink.type = hdfs
    agent.sinks.hdfsSink.hdfs.path = hdfs://beicai/test/pan
    agent.sinks.hdfsSink.hdfs.writeFormat = Text
    agent.sinks.hdfsSink.hdfs.fileType = DataStream


    agent.sinks.hdfsSink.hdfs.rollSize = 1024
    agent.sinks.hdfsSink.hdfs.rollCount = 0
    agent.sinks.hdfsSink.hdfs.rollInterval = 60

    agent.sinks.hdfsSink.hdfs.filePrefix=test
    agent.sinks.hdfsSink.hdfs.fileSuffix=.data

    agent.sinks.hdfsSink.hdfs.inUserPrefix=_
    agent.sinks.hdfsSink.hdfs.inUserSuffix=
    agent.sinks.hdfsSink.hdfs.fileType = DataStream
    agent.sinks.hdfsSink.hdfs.writeFormat = TEXT
    agent.sinks.hdfsSink.hdfs.rollInterval = 1
    agent.sinks.sink1.hdfs.filePrefix =A

    agent.sources.kafkaSource.channels = memoryChannel
    agent.sinks.hdfsSink.channel = memoryChannel

    成就人
  • 相关阅读:
    springBoot 与 springMVC的区别
    spring的IOC和AOP
    实现同步的三种方法
    台阶积水问题
    requsets模块和beautifulsoup模块
    爬虫
    rabbitMQ 消息队列
    Django框架
    mysql
    jQuery
  • 原文地址:https://www.cnblogs.com/pingzizhuanshu/p/9102602.html
Copyright © 2011-2022 走看看