zoukankan      html  css  js  c++  java
  • Spark Streaming之旅

    1. 打开spark-shell

    2. 建立StreamingContext
    import org.apache.spark.streaming._
    import org.apache.spark.streaming.StreamingContext._
    import org.apache.spark.api.java.function._
    import org.apache.spark.streaming._
    import org.apache.spark.streaming.api._
    // Create a StreamingContext with a local master
    val ssc = new StreamingContext(sc, Seconds(1))

    3.用StreamingContext建立一个DStream

    val lines = ssc.socketTextStream("localhost", 9999)

    4.运算

    // Split each line into words
    val words = lines.flatMap(_.split(" "))
    
    import org.apache.spark.streaming.StreamingContext._
    // Count each word in each batch
    val pairs = words.map(word => (word, 1))
    val wordCounts = pairs.reduceByKey(_ + _)
    
    // Print a few of the counts to the console
    wordCounts.print()

    5.启动DStream

    ssc.start()             // Start the computation
    ssc.awaitTermination()  // Wait for the computation to terminate

    6.启动一个新控制台,并执行如下命令

    $ nc -lk 9999

    7.在新控制台上输入任意语句,并在spark-shell窗口观察输出

    hello world how are you hello are you there

    you are the son of a bitch



  • 相关阅读:
    MySQL教程22-字符串类型
    MySQL教程21-日期和时间类型
    MySQL教程20-小数类型
    MySQL教程19-整数类型
    MySQL教程18-数据类型简介
    ActiveMQ_topic
    ActiveMQ_消费者编码
    ActiveMQ_生产者编码
    ActiveMQ介绍
    管理docker容器
  • 原文地址:https://www.cnblogs.com/littlesuccess/p/3794225.html
Copyright © 2011-2022 走看看