zoukankan      html  css  js  c++  java
  • pyspark kafka createDirectStream和createStream 区别

    from pyspark.streaming.kafka import KafkaUtils
    
     kafkaStream = KafkaUtils.createStream(streamingContext, 
         [ZK quorum], [consumer group id], [per-topic number of Kafka partitions to consume])


     from pyspark.streaming.kafka import KafkaUtils
     directKafkaStream = KafkaUtils.createDirectStream(ssc, [topic], {"metadata.broker.list": brokers})

    就是参数不一样。其中createStream用的ZK quorum是zk的2181端口。而createDirectStream用的是kafka进程9092端口。

    Kafka的进程ID为9300,占用端口为9092

    QuorumPeerMain为对应的zookeeper实例,进程ID为6379,在2181端口监听

    所以在运行官方例子时候

    一个是

    ./bin/spark-submit --jars ~/spark-streaming-kafka-0-8-assembly_2.11-2.2.0.jar examples/src/main/python/streaming/direct_kafka_wordcount.py localhost:9092 test

    另外一个是:

     ./bin/spark-submit --jars ~/spark-streaming-kafka-0-8-assembly_2.11-2.2.0.jar examples/src/main/python/streaming/direct_kafka_wordcount.py localhost:2181 test

    参考:

    https://spark.apache.org/docs/1.6.1/streaming-kafka-integration.html

    http://zhangfengzhe.blog.51cto.com/8855103/1556650

  • 相关阅读:
    [转载]C#.NET中Dns类的常用方法及说明
    [转载]如何辨别真假百度蜘蛛
    Lottie的json动画
    iOT
    iOS字体大小
    针对Xcode 9 + iOS11 的修改,及iPhone X的适配
    shell脚本之 给PNG图片添加后缀@3x
    正则表达式
    CSS
    XcodeProj,使用Ruby更改工程文件
  • 原文地址:https://www.cnblogs.com/bonelee/p/7443026.html
Copyright © 2011-2022 走看看