zoukankan      html  css  js  c++  java
  • 基于IDEA环境下的Spark2.X程序开发

     

    我们选择在线安装

    这个是windows下的scala,直接双击安装就可以了

    安装好之后可以验证一下

     

    这个是我本地的jdk1.8安装包,直接双击安装

    安装完成后可以验证一下

    https://archive.apache.org/dist/maven/maven-3/3.3.9/binaries/

     

    解压

     

     我的本地是win10系统

     

    配置好环境变量我们可以验证一下

    修改这个文件

    这个是默认的

    改成这样子

    把本地的maven配置进来

     

     

    接下来就是等待自动把相应的架包下载下来

     

     

    把scala添加进来了

     接下来我们创建目录

     

     

    在scala目录下建包

     

     在这个包里面创建一个scala的类

    输入以下代码

    配置maven的 pom.xml文件

    <project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd">
    <modelVersion>4.0.0</modelVersion>
    <groupId>com.spark</groupId>
    <artifactId>sparkStu</artifactId>
    <packaging>war</packaging>
    <version>1.0-SNAPSHOT</version>
    <name>sparkStu Maven Webapp</name>
    <url>http://maven.apache.org</url>

    <properties>
    <hadoop.version>2.6.0</hadoop.version>
    <scala.binary.version>2.11</scala.binary.version>
    <spark.version>2.2.0</spark.version>
    </properties>

    <dependencies>
    <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-core_${scala.binary.version}</artifactId>
    <version>${spark.version}</version>
    </dependency>

    <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-sql_${scala.binary.version}</artifactId>
    <version>${spark.version}</version>
    </dependency>

    <!--
    <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-core_2.11</artifactId>
    <version>2.2.0</version>
    </dependency>
    -->
    <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-streaming_${scala.binary.version}</artifactId>
    <version>${spark.version}</version>
    </dependency>

    <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-hive_${scala.binary.version}</artifactId>
    <version>${spark.version}</version>
    </dependency>

    <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-streaming-kafka-0-10_${scala.binary.version}</artifactId>
    <version>${spark.version}</version>
    </dependency>

    <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-sql-kafka-0-10_${scala.binary.version}</artifactId>
    <version>${spark.version}</version>
    </dependency>

    <dependency>
    <groupId>org.apache.hadoop</groupId>
    <artifactId>hadoop-client</artifactId>
    <version>${hadoop.version}</version>
    </dependency>

    <dependency>
    <groupId>junit</groupId>
    <artifactId>junit</artifactId>
    <version>3.8.1</version>
    <scope>test</scope>
    </dependency>



    </dependencies>
    <build>
    <finalName>sparkStu</finalName>
    </build>
    </project>

     在Test.scala里加上这段内容

     我们编写一个简单的代码

    package com.spark.test
    
    import org.apache.spark.sql.SparkSession
    
    object Test {
    
      def main(args: Array[String]): Unit = {
       val spark= SparkSession
           .builder
             .appName("HdfsTest")
               .getOrCreate()
         val filePart = "E://Mycode/datas/stu.txt"
         val rdd= spark.sparkContext.textFile(filePart)
    
         val lines= rdd.flatMap(x => x.split(" ")).collect().toList
         println(lines)
      }
    }

     运行一下

     结果报错了

    E:softwarejdk1.8injava "-javaagent:E:softwareIDEAIntelliJ IDEA 2017.2.6libidea_rt.jar=59010:E:softwareIDEAIntelliJ IDEA 2017.2.6in" -Dfile.encoding=UTF-8 -classpath E:softwarejdk1.8jrelibcharsets.jar;E:softwarejdk1.8jrelibdeploy.jar;E:softwarejdk1.8jrelibextaccess-bridge-64.jar;E:softwarejdk1.8jrelibextcldrdata.jar;E:softwarejdk1.8jrelibextdnsns.jar;E:softwarejdk1.8jrelibextjaccess.jar;E:softwarejdk1.8jrelibextjfxrt.jar;E:softwarejdk1.8jrelibextlocaledata.jar;E:softwarejdk1.8jrelibext
    ashorn.jar;E:softwarejdk1.8jrelibextsunec.jar;E:softwarejdk1.8jrelibextsunjce_provider.jar;E:softwarejdk1.8jrelibextsunmscapi.jar;E:softwarejdk1.8jrelibextsunpkcs11.jar;E:softwarejdk1.8jrelibextzipfs.jar;E:softwarejdk1.8jrelibjavaws.jar;E:softwarejdk1.8jrelibjce.jar;E:softwarejdk1.8jrelibjfr.jar;E:softwarejdk1.8jrelibjfxswt.jar;E:softwarejdk1.8jrelibjsse.jar;E:softwarejdk1.8jrelibmanagement-agent.jar;E:softwarejdk1.8jrelibplugin.jar;E:softwarejdk1.8jrelib
    esources.jar;E:softwarejdk1.8jrelib
    t.jar;E:MycodeSparkStu	argetclasses;E:softwareScalalibscala-actors-2.11.0.jar;E:softwareScalalibscala-actors-migration_2.11-1.1.0.jar;E:softwareScalalibscala-library.jar;E:softwareScalalibscala-parser-combinators_2.11-1.0.4.jar;E:softwareScalalibscala-reflect.jar;E:softwareScalalibscala-swing_2.11-1.0.2.jar;E:softwareScalalibscala-xml_2.11-1.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-core_2.112.2.0spark-core_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro1.7.7avro-1.7.7.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-core-asl1.9.13jackson-core-asl-1.9.13.jar;E:softwaremaven3.3.9
    epositorycom	houghtworksparanamerparanamer2.3paranamer-2.3.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-compress1.4.1commons-compress-1.4.1.jar;E:softwaremaven3.3.9
    epositoryorg	ukaanixz1.0xz-1.0.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-mapred1.7.7avro-mapred-1.7.7-hadoop2.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-ipc1.7.7avro-ipc-1.7.7.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-ipc1.7.7avro-ipc-1.7.7-tests.jar;E:softwaremaven3.3.9
    epositorycom	witterchill_2.110.8.0chill_2.11-0.8.0.jar;E:softwaremaven3.3.9
    epositorycomesotericsoftwarekryo-shaded3.0.3kryo-shaded-3.0.3.jar;E:softwaremaven3.3.9
    epositorycomesotericsoftwareminlog1.3.0minlog-1.3.0.jar;E:softwaremaven3.3.9
    epositoryorgobjenesisobjenesis2.1objenesis-2.1.jar;E:softwaremaven3.3.9
    epositorycom	witterchill-java0.8.0chill-java-0.8.0.jar;E:softwaremaven3.3.9
    epositoryorgapachexbeanxbean-asm5-shaded4.4xbean-asm5-shaded-4.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-launcher_2.112.2.0spark-launcher_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-network-common_2.112.2.0spark-network-common_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgfusesourceleveldbjnileveldbjni-all1.8leveldbjni-all-1.8.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-annotations2.6.5jackson-annotations-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-network-shuffle_2.112.2.0spark-network-shuffle_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-unsafe_2.112.2.0spark-unsafe_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epository
    etjavadevjets3tjets3t0.9.3jets3t-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapachehttpcomponentshttpcore4.3.3httpcore-4.3.3.jar;E:softwaremaven3.3.9
    epositoryjavaxactivationactivation1.1.1activation-1.1.1.jar;E:softwaremaven3.3.9
    epositorymx4jmx4j3.0.2mx4j-3.0.2.jar;E:softwaremaven3.3.9
    epositoryjavaxmailmail1.4.7mail-1.4.7.jar;E:softwaremaven3.3.9
    epositoryorgouncycastlecprov-jdk15on1.51cprov-jdk15on-1.51.jar;E:softwaremaven3.3.9
    epositorycomjamesmurtyutilsjava-xmlbuilder1.0java-xmlbuilder-1.0.jar;E:softwaremaven3.3.9
    epository
    etiharderase642.3.8ase64-2.3.8.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-recipes2.6.0curator-recipes-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-framework2.6.0curator-framework-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachezookeeperzookeeper3.4.6zookeeper-3.4.6.jar;E:softwaremaven3.3.9
    epositorycomgoogleguavaguava16.0.1guava-16.0.1.jar;E:softwaremaven3.3.9
    epositoryjavaxservletjavax.servlet-api3.1.0javax.servlet-api-3.1.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-lang33.5commons-lang3-3.5.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-math33.4.1commons-math3-3.4.1.jar;E:softwaremaven3.3.9
    epositorycomgooglecodefindbugsjsr3051.3.9jsr305-1.3.9.jar;E:softwaremaven3.3.9
    epositoryorgslf4jslf4j-api1.7.16slf4j-api-1.7.16.jar;E:softwaremaven3.3.9
    epositoryorgslf4jjul-to-slf4j1.7.16jul-to-slf4j-1.7.16.jar;E:softwaremaven3.3.9
    epositoryorgslf4jjcl-over-slf4j1.7.16jcl-over-slf4j-1.7.16.jar;E:softwaremaven3.3.9
    epositorylog4jlog4j1.2.17log4j-1.2.17.jar;E:softwaremaven3.3.9
    epositoryorgslf4jslf4j-log4j121.7.16slf4j-log4j12-1.7.16.jar;E:softwaremaven3.3.9
    epositorycom
    ingcompress-lzf1.0.3compress-lzf-1.0.3.jar;E:softwaremaven3.3.9
    epositoryorgxerialsnappysnappy-java1.1.2.6snappy-java-1.1.2.6.jar;E:softwaremaven3.3.9
    epository
    etjpountzlz4lz41.3.0lz4-1.3.0.jar;E:softwaremaven3.3.9
    epositoryorg
    oaringbitmapRoaringBitmap0.5.11RoaringBitmap-0.5.11.jar;E:softwaremaven3.3.9
    epositorycommons-netcommons-net2.2commons-net-2.2.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-library2.11.8scala-library-2.11.8.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-jackson_2.113.2.11json4s-jackson_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-core_2.113.2.11json4s-core_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-ast_2.113.2.11json4s-ast_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscalap2.11.0scalap-2.11.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-compiler2.11.0scala-compiler-2.11.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langmodulesscala-xml_2.111.0.1scala-xml_2.11-1.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-client2.22.2jersey-client-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxws
    sjavax.ws.rs-api2.0.1javax.ws.rs-api-2.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-api2.4.0-b34hk2-api-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-utils2.4.0-b34hk2-utils-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2externalaopalliance-repackaged2.4.0-b34aopalliance-repackaged-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2externaljavax.inject2.4.0-b34javax.inject-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-locator2.4.0-b34hk2-locator-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgjavassistjavassist3.18.1-GAjavassist-3.18.1-GA.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-common2.22.2jersey-common-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxannotationjavax.annotation-api1.2javax.annotation-api-1.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseyundles
    epackagedjersey-guava2.22.2jersey-guava-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2osgi-resource-locator1.0.1osgi-resource-locator-1.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-server2.22.2jersey-server-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseymediajersey-media-jaxb2.22.2jersey-media-jaxb-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxvalidationvalidation-api1.1.0.Finalvalidation-api-1.1.0.Final.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycontainersjersey-container-servlet2.22.2jersey-container-servlet-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycontainersjersey-container-servlet-core2.22.2jersey-container-servlet-core-2.22.2.jar;E:softwaremaven3.3.9
    epositoryio
    etty
    etty-all4.0.43.Final
    etty-all-4.0.43.Final.jar;E:softwaremaven3.3.9
    epositoryio
    etty
    etty3.9.9.Final
    etty-3.9.9.Final.jar;E:softwaremaven3.3.9
    epositorycomclearspringanalyticsstream2.7.0stream-2.7.0.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-core3.1.2metrics-core-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-jvm3.1.2metrics-jvm-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-json3.1.2metrics-json-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-graphite3.1.2metrics-graphite-3.1.2.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-databind2.6.5jackson-databind-2.6.5.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-core2.6.5jackson-core-2.6.5.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksonmodulejackson-module-scala_2.112.6.5jackson-module-scala_2.11-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-reflect2.11.7scala-reflect-2.11.7.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksonmodulejackson-module-paranamer2.6.5jackson-module-paranamer-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgapacheivyivy2.4.0ivy-2.4.0.jar;E:softwaremaven3.3.9
    epositoryorooro2.0.8oro-2.0.8.jar;E:softwaremaven3.3.9
    epository
    et
    azorvinepyrolite4.13pyrolite-4.13.jar;E:softwaremaven3.3.9
    epository
    etsfpy4jpy4j0.10.4py4j-0.10.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-tags_2.112.2.0spark-tags_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-crypto1.0.0commons-crypto-1.0.0.jar;E:softwaremaven3.3.9
    epositoryorgspark-projectsparkunused1.0.0unused-1.0.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sql_2.112.2.0spark-sql_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositorycomunivocityunivocity-parsers2.2.1univocity-parsers-2.2.1.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sketch_2.112.2.0spark-sketch_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-catalyst_2.112.2.0spark-catalyst_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjaninojanino3.0.0janino-3.0.0.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjaninocommons-compiler3.0.0commons-compiler-3.0.0.jar;E:softwaremaven3.3.9
    epositoryorgantlrantlr4-runtime4.5.3antlr4-runtime-4.5.3.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-column1.8.2parquet-column-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-common1.8.2parquet-common-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-encoding1.8.2parquet-encoding-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-hadoop1.8.2parquet-hadoop-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-format2.3.1parquet-format-2.3.1.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-jackson1.8.2parquet-jackson-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-streaming_2.112.2.0spark-streaming_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-hive_2.112.2.0spark-hive_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositorycom	witterparquet-hadoop-bundle1.6.0parquet-hadoop-bundle-1.6.0.jar;E:softwaremaven3.3.9
    epositoryorgspark-projecthivehive-exec1.2.1.spark2hive-exec-1.2.1.spark2.jar;E:softwaremaven3.3.9
    epositorycommons-iocommons-io2.4commons-io-2.4.jar;E:softwaremaven3.3.9
    epositorycommons-langcommons-lang2.6commons-lang-2.6.jar;E:softwaremaven3.3.9
    epositoryjavolutionjavolution5.5.1javolution-5.5.1.jar;E:softwaremaven3.3.9
    epositorylog4japache-log4j-extras1.2.17apache-log4j-extras-1.2.17.jar;E:softwaremaven3.3.9
    epositoryorgantlrantlr-runtime3.4antlr-runtime-3.4.jar;E:softwaremaven3.3.9
    epositoryorgantlrstringtemplate3.2.1stringtemplate-3.2.1.jar;E:softwaremaven3.3.9
    epositoryantlrantlr2.7.7antlr-2.7.7.jar;E:softwaremaven3.3.9
    epositoryorgantlrST44.0.4ST4-4.0.4.jar;E:softwaremaven3.3.9
    epositorycomgooglecodejavaewahJavaEWAH0.3.2JavaEWAH-0.3.2.jar;E:softwaremaven3.3.9
    epositoryorgiq80snappysnappy0.2snappy-0.2.jar;E:softwaremaven3.3.9
    epositorystaxstax-api1.0.1stax-api-1.0.1.jar;E:softwaremaven3.3.9
    epository
    etsfopencsvopencsv2.3opencsv-2.3.jar;E:softwaremaven3.3.9
    epositoryorgspark-projecthivehive-metastore1.2.1.spark2hive-metastore-1.2.1.spark2.jar;E:softwaremaven3.3.9
    epositorycomjolboxonecp0.8.0.RELEASEonecp-0.8.0.RELEASE.jar;E:softwaremaven3.3.9
    epositorycommons-clicommons-cli1.2commons-cli-1.2.jar;E:softwaremaven3.3.9
    epositorycommons-loggingcommons-logging1.1.3commons-logging-1.1.3.jar;E:softwaremaven3.3.9
    epositoryorgapachederbyderby10.10.2.0derby-10.10.2.0.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-api-jdo3.2.6datanucleus-api-jdo-3.2.6.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-rdbms3.2.9datanucleus-rdbms-3.2.9.jar;E:softwaremaven3.3.9
    epositorycommons-poolcommons-pool1.5.4commons-pool-1.5.4.jar;E:softwaremaven3.3.9
    epositorycommons-dbcpcommons-dbcp1.4commons-dbcp-1.4.jar;E:softwaremaven3.3.9
    epositoryjavaxjdojdo-api3.0.1jdo-api-3.0.1.jar;E:softwaremaven3.3.9
    epositoryjavax	ransactionjta1.1jta-1.1.jar;E:softwaremaven3.3.9
    epositorycommons-httpclientcommons-httpclient3.1commons-httpclient-3.1.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-avatica1.2.0-incubatingcalcite-avatica-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-core1.2.0-incubatingcalcite-core-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-linq4j1.2.0-incubatingcalcite-linq4j-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epository
    ethydromaticeigenbase-properties1.1.5eigenbase-properties-1.1.5.jar;E:softwaremaven3.3.9
    epositoryorgapachehttpcomponentshttpclient4.5.2httpclient-4.5.2.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-mapper-asl1.9.13jackson-mapper-asl-1.9.13.jar;E:softwaremaven3.3.9
    epositorycommons-codeccommons-codec1.10commons-codec-1.10.jar;E:softwaremaven3.3.9
    epositoryjoda-timejoda-time2.9.3joda-time-2.9.3.jar;E:softwaremaven3.3.9
    epositoryorgjoddjodd-core3.5.2jodd-core-3.5.2.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-core3.2.10datanucleus-core-3.2.10.jar;E:softwaremaven3.3.9
    epositoryorgapache	hriftlibthrift0.9.3libthrift-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapache	hriftlibfb3030.9.3libfb303-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-streaming-kafka-0-10_2.112.2.0spark-streaming-kafka-0-10_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachekafkakafka_2.110.10.0.1kafka_2.11-0.10.0.1.jar;E:softwaremaven3.3.9
    epositorycom101teczkclient0.8zkclient-0.8.jar;E:softwaremaven3.3.9
    epositorycomyammermetricsmetrics-core2.2.0metrics-core-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langmodulesscala-parser-combinators_2.111.0.4scala-parser-combinators_2.11-1.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sql-kafka-0-10_2.112.2.0spark-sql-kafka-0-10_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachekafkakafka-clients0.10.0.1kafka-clients-0.10.0.1.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-client2.6.0hadoop-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-common2.6.0hadoop-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryxmlencxmlenc0.52xmlenc-0.52.jar;E:softwaremaven3.3.9
    epositorycommons-collectionscommons-collections3.2.1commons-collections-3.2.1.jar;E:softwaremaven3.3.9
    epositorycommons-configurationcommons-configuration1.6commons-configuration-1.6.jar;E:softwaremaven3.3.9
    epositorycommons-digestercommons-digester1.8commons-digester-1.8.jar;E:softwaremaven3.3.9
    epositorycommons-beanutilscommons-beanutils1.7.0commons-beanutils-1.7.0.jar;E:softwaremaven3.3.9
    epositorycommons-beanutilscommons-beanutils-core1.8.0commons-beanutils-core-1.8.0.jar;E:softwaremaven3.3.9
    epositorycomgoogleprotobufprotobuf-java2.5.0protobuf-java-2.5.0.jar;E:softwaremaven3.3.9
    epositorycomgooglecodegsongson2.2.4gson-2.2.4.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-auth2.6.0hadoop-auth-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryserverapacheds-kerberos-codec2.0.0-M15apacheds-kerberos-codec-2.0.0-M15.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryserverapacheds-i18n2.0.0-M15apacheds-i18n-2.0.0-M15.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryapiapi-asn1-api1.0.0-M20api-asn1-api-1.0.0-M20.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryapiapi-util1.0.0-M20api-util-1.0.0-M20.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-client2.6.0curator-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorghtracehtrace-core3.0.4htrace-core-3.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-hdfs2.6.0hadoop-hdfs-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgmortbayjettyjetty-util6.1.26jetty-util-6.1.26.jar;E:softwaremaven3.3.9
    epositoryxercesxercesImpl2.9.1xercesImpl-2.9.1.jar;E:softwaremaven3.3.9
    epositoryxml-apisxml-apis1.3.04xml-apis-1.3.04.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-app2.6.0hadoop-mapreduce-client-app-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-common2.6.0hadoop-mapreduce-client-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-client2.6.0hadoop-yarn-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-server-common2.6.0hadoop-yarn-server-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-shuffle2.6.0hadoop-mapreduce-client-shuffle-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-api2.6.0hadoop-yarn-api-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-core2.6.0hadoop-mapreduce-client-core-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-common2.6.0hadoop-yarn-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryjavaxxmlindjaxb-api2.2.2jaxb-api-2.2.2.jar;E:softwaremaven3.3.9
    epositoryjavaxxmlstreamstax-api1.0-2stax-api-1.0-2.jar;E:softwaremaven3.3.9
    epositoryjavaxservletservlet-api2.5servlet-api-2.5.jar;E:softwaremaven3.3.9
    epositorycomsunjerseyjersey-core1.9jersey-core-1.9.jar;E:softwaremaven3.3.9
    epositorycomsunjerseyjersey-client1.9jersey-client-1.9.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-jaxrs1.9.13jackson-jaxrs-1.9.13.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-xc1.9.13jackson-xc-1.9.13.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-jobclient2.6.0hadoop-mapreduce-client-jobclient-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-annotations2.6.0hadoop-annotations-2.6.0.jar com.spark.test.Test
    Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
    18/03/14 17:01:07 INFO SparkContext: Running Spark version 2.2.0
    18/03/14 17:01:07 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
    18/03/14 17:01:08 ERROR Shell: Failed to locate the winutils binary in the hadoop binary path
    java.io.IOException: Could not locate executable nullinwinutils.exe in the Hadoop binaries.
        at org.apache.hadoop.util.Shell.getQualifiedBinPath(Shell.java:355)
        at org.apache.hadoop.util.Shell.getWinUtilsPath(Shell.java:370)
        at org.apache.hadoop.util.Shell.<clinit>(Shell.java:363)
        at org.apache.hadoop.util.StringUtils.<clinit>(StringUtils.java:79)
        at org.apache.hadoop.security.Groups.parseStaticMapping(Groups.java:104)
        at org.apache.hadoop.security.Groups.<init>(Groups.java:86)
        at org.apache.hadoop.security.Groups.<init>(Groups.java:66)
        at org.apache.hadoop.security.Groups.getUserToGroupsMappingService(Groups.java:280)
        at org.apache.hadoop.security.UserGroupInformation.initialize(UserGroupInformation.java:271)
        at org.apache.hadoop.security.UserGroupInformation.ensureInitialized(UserGroupInformation.java:248)
        at org.apache.hadoop.security.UserGroupInformation.loginUserFromSubject(UserGroupInformation.java:763)
        at org.apache.hadoop.security.UserGroupInformation.getLoginUser(UserGroupInformation.java:748)
        at org.apache.hadoop.security.UserGroupInformation.getCurrentUser(UserGroupInformation.java:621)
        at org.apache.spark.util.Utils$$anonfun$getCurrentUserName$1.apply(Utils.scala:2430)
        at org.apache.spark.util.Utils$$anonfun$getCurrentUserName$1.apply(Utils.scala:2430)
        at scala.Option.getOrElse(Option.scala:121)
        at org.apache.spark.util.Utils$.getCurrentUserName(Utils.scala:2430)
        at org.apache.spark.SparkContext.<init>(SparkContext.scala:295)
        at org.apache.spark.SparkContext$.getOrCreate(SparkContext.scala:2509)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:909)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:901)
        at scala.Option.getOrElse(Option.scala:121)
        at org.apache.spark.sql.SparkSession$Builder.getOrCreate(SparkSession.scala:901)
        at com.spark.test.Test$.main(Test.scala:11)
        at com.spark.test.Test.main(Test.scala)
    18/03/14 17:01:08 ERROR SparkContext: Error initializing SparkContext.
    org.apache.spark.SparkException: A master URL must be set in your configuration
        at org.apache.spark.SparkContext.<init>(SparkContext.scala:376)
        at org.apache.spark.SparkContext$.getOrCreate(SparkContext.scala:2509)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:909)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:901)
        at scala.Option.getOrElse(Option.scala:121)
        at org.apache.spark.sql.SparkSession$Builder.getOrCreate(SparkSession.scala:901)
        at com.spark.test.Test$.main(Test.scala:11)
        at com.spark.test.Test.main(Test.scala)
    18/03/14 17:01:08 INFO SparkContext: Successfully stopped SparkContext
    Exception in thread "main" org.apache.spark.SparkException: A master URL must be set in your configuration
        at org.apache.spark.SparkContext.<init>(SparkContext.scala:376)
        at org.apache.spark.SparkContext$.getOrCreate(SparkContext.scala:2509)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:909)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:901)
        at scala.Option.getOrElse(Option.scala:121)
        at org.apache.spark.sql.SparkSession$Builder.getOrCreate(SparkSession.scala:901)
        at com.spark.test.Test$.main(Test.scala:11)
        at com.spark.test.Test.main(Test.scala)
    
    Process finished with exit code 1

     这是因为我本地没有配置好hadoop,现在我们配一个

    这个是我本地的hadoop/bin

     下面把本地win10的环境变量配置一下

     

     

     再重启一下idea,再运行一下程序

    报了另外一个错误,但是可以确定的是前面的错误我们解决了

    E:softwarejdk1.8injava "-javaagent:E:softwareIDEAIntelliJ IDEA 2017.2.6libidea_rt.jar=60011:E:softwareIDEAIntelliJ IDEA 2017.2.6in" -Dfile.encoding=UTF-8 -classpath E:softwarejdk1.8jrelibcharsets.jar;E:softwarejdk1.8jrelibdeploy.jar;E:softwarejdk1.8jrelibextaccess-bridge-64.jar;E:softwarejdk1.8jrelibextcldrdata.jar;E:softwarejdk1.8jrelibextdnsns.jar;E:softwarejdk1.8jrelibextjaccess.jar;E:softwarejdk1.8jrelibextjfxrt.jar;E:softwarejdk1.8jrelibextlocaledata.jar;E:softwarejdk1.8jrelibext
    ashorn.jar;E:softwarejdk1.8jrelibextsunec.jar;E:softwarejdk1.8jrelibextsunjce_provider.jar;E:softwarejdk1.8jrelibextsunmscapi.jar;E:softwarejdk1.8jrelibextsunpkcs11.jar;E:softwarejdk1.8jrelibextzipfs.jar;E:softwarejdk1.8jrelibjavaws.jar;E:softwarejdk1.8jrelibjce.jar;E:softwarejdk1.8jrelibjfr.jar;E:softwarejdk1.8jrelibjfxswt.jar;E:softwarejdk1.8jrelibjsse.jar;E:softwarejdk1.8jrelibmanagement-agent.jar;E:softwarejdk1.8jrelibplugin.jar;E:softwarejdk1.8jrelib
    esources.jar;E:softwarejdk1.8jrelib
    t.jar;E:MycodeSparkStu	argetclasses;E:softwareScalalibscala-actors-2.11.0.jar;E:softwareScalalibscala-actors-migration_2.11-1.1.0.jar;E:softwareScalalibscala-library.jar;E:softwareScalalibscala-parser-combinators_2.11-1.0.4.jar;E:softwareScalalibscala-reflect.jar;E:softwareScalalibscala-swing_2.11-1.0.2.jar;E:softwareScalalibscala-xml_2.11-1.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-core_2.112.2.0spark-core_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro1.7.7avro-1.7.7.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-core-asl1.9.13jackson-core-asl-1.9.13.jar;E:softwaremaven3.3.9
    epositorycom	houghtworksparanamerparanamer2.3paranamer-2.3.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-compress1.4.1commons-compress-1.4.1.jar;E:softwaremaven3.3.9
    epositoryorg	ukaanixz1.0xz-1.0.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-mapred1.7.7avro-mapred-1.7.7-hadoop2.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-ipc1.7.7avro-ipc-1.7.7.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-ipc1.7.7avro-ipc-1.7.7-tests.jar;E:softwaremaven3.3.9
    epositorycom	witterchill_2.110.8.0chill_2.11-0.8.0.jar;E:softwaremaven3.3.9
    epositorycomesotericsoftwarekryo-shaded3.0.3kryo-shaded-3.0.3.jar;E:softwaremaven3.3.9
    epositorycomesotericsoftwareminlog1.3.0minlog-1.3.0.jar;E:softwaremaven3.3.9
    epositoryorgobjenesisobjenesis2.1objenesis-2.1.jar;E:softwaremaven3.3.9
    epositorycom	witterchill-java0.8.0chill-java-0.8.0.jar;E:softwaremaven3.3.9
    epositoryorgapachexbeanxbean-asm5-shaded4.4xbean-asm5-shaded-4.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-launcher_2.112.2.0spark-launcher_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-network-common_2.112.2.0spark-network-common_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgfusesourceleveldbjnileveldbjni-all1.8leveldbjni-all-1.8.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-annotations2.6.5jackson-annotations-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-network-shuffle_2.112.2.0spark-network-shuffle_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-unsafe_2.112.2.0spark-unsafe_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epository
    etjavadevjets3tjets3t0.9.3jets3t-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapachehttpcomponentshttpcore4.3.3httpcore-4.3.3.jar;E:softwaremaven3.3.9
    epositoryjavaxactivationactivation1.1.1activation-1.1.1.jar;E:softwaremaven3.3.9
    epositorymx4jmx4j3.0.2mx4j-3.0.2.jar;E:softwaremaven3.3.9
    epositoryjavaxmailmail1.4.7mail-1.4.7.jar;E:softwaremaven3.3.9
    epositoryorgouncycastlecprov-jdk15on1.51cprov-jdk15on-1.51.jar;E:softwaremaven3.3.9
    epositorycomjamesmurtyutilsjava-xmlbuilder1.0java-xmlbuilder-1.0.jar;E:softwaremaven3.3.9
    epository
    etiharderase642.3.8ase64-2.3.8.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-recipes2.6.0curator-recipes-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-framework2.6.0curator-framework-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachezookeeperzookeeper3.4.6zookeeper-3.4.6.jar;E:softwaremaven3.3.9
    epositorycomgoogleguavaguava16.0.1guava-16.0.1.jar;E:softwaremaven3.3.9
    epositoryjavaxservletjavax.servlet-api3.1.0javax.servlet-api-3.1.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-lang33.5commons-lang3-3.5.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-math33.4.1commons-math3-3.4.1.jar;E:softwaremaven3.3.9
    epositorycomgooglecodefindbugsjsr3051.3.9jsr305-1.3.9.jar;E:softwaremaven3.3.9
    epositoryorgslf4jslf4j-api1.7.16slf4j-api-1.7.16.jar;E:softwaremaven3.3.9
    epositoryorgslf4jjul-to-slf4j1.7.16jul-to-slf4j-1.7.16.jar;E:softwaremaven3.3.9
    epositoryorgslf4jjcl-over-slf4j1.7.16jcl-over-slf4j-1.7.16.jar;E:softwaremaven3.3.9
    epositorylog4jlog4j1.2.17log4j-1.2.17.jar;E:softwaremaven3.3.9
    epositoryorgslf4jslf4j-log4j121.7.16slf4j-log4j12-1.7.16.jar;E:softwaremaven3.3.9
    epositorycom
    ingcompress-lzf1.0.3compress-lzf-1.0.3.jar;E:softwaremaven3.3.9
    epositoryorgxerialsnappysnappy-java1.1.2.6snappy-java-1.1.2.6.jar;E:softwaremaven3.3.9
    epository
    etjpountzlz4lz41.3.0lz4-1.3.0.jar;E:softwaremaven3.3.9
    epositoryorg
    oaringbitmapRoaringBitmap0.5.11RoaringBitmap-0.5.11.jar;E:softwaremaven3.3.9
    epositorycommons-netcommons-net2.2commons-net-2.2.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-library2.11.8scala-library-2.11.8.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-jackson_2.113.2.11json4s-jackson_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-core_2.113.2.11json4s-core_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-ast_2.113.2.11json4s-ast_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscalap2.11.0scalap-2.11.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-compiler2.11.0scala-compiler-2.11.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langmodulesscala-xml_2.111.0.1scala-xml_2.11-1.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-client2.22.2jersey-client-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxws
    sjavax.ws.rs-api2.0.1javax.ws.rs-api-2.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-api2.4.0-b34hk2-api-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-utils2.4.0-b34hk2-utils-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2externalaopalliance-repackaged2.4.0-b34aopalliance-repackaged-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2externaljavax.inject2.4.0-b34javax.inject-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-locator2.4.0-b34hk2-locator-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgjavassistjavassist3.18.1-GAjavassist-3.18.1-GA.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-common2.22.2jersey-common-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxannotationjavax.annotation-api1.2javax.annotation-api-1.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseyundles
    epackagedjersey-guava2.22.2jersey-guava-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2osgi-resource-locator1.0.1osgi-resource-locator-1.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-server2.22.2jersey-server-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseymediajersey-media-jaxb2.22.2jersey-media-jaxb-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxvalidationvalidation-api1.1.0.Finalvalidation-api-1.1.0.Final.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycontainersjersey-container-servlet2.22.2jersey-container-servlet-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycontainersjersey-container-servlet-core2.22.2jersey-container-servlet-core-2.22.2.jar;E:softwaremaven3.3.9
    epositoryio
    etty
    etty-all4.0.43.Final
    etty-all-4.0.43.Final.jar;E:softwaremaven3.3.9
    epositoryio
    etty
    etty3.9.9.Final
    etty-3.9.9.Final.jar;E:softwaremaven3.3.9
    epositorycomclearspringanalyticsstream2.7.0stream-2.7.0.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-core3.1.2metrics-core-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-jvm3.1.2metrics-jvm-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-json3.1.2metrics-json-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-graphite3.1.2metrics-graphite-3.1.2.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-databind2.6.5jackson-databind-2.6.5.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-core2.6.5jackson-core-2.6.5.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksonmodulejackson-module-scala_2.112.6.5jackson-module-scala_2.11-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-reflect2.11.7scala-reflect-2.11.7.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksonmodulejackson-module-paranamer2.6.5jackson-module-paranamer-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgapacheivyivy2.4.0ivy-2.4.0.jar;E:softwaremaven3.3.9
    epositoryorooro2.0.8oro-2.0.8.jar;E:softwaremaven3.3.9
    epository
    et
    azorvinepyrolite4.13pyrolite-4.13.jar;E:softwaremaven3.3.9
    epository
    etsfpy4jpy4j0.10.4py4j-0.10.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-tags_2.112.2.0spark-tags_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-crypto1.0.0commons-crypto-1.0.0.jar;E:softwaremaven3.3.9
    epositoryorgspark-projectsparkunused1.0.0unused-1.0.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sql_2.112.2.0spark-sql_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositorycomunivocityunivocity-parsers2.2.1univocity-parsers-2.2.1.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sketch_2.112.2.0spark-sketch_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-catalyst_2.112.2.0spark-catalyst_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjaninojanino3.0.0janino-3.0.0.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjaninocommons-compiler3.0.0commons-compiler-3.0.0.jar;E:softwaremaven3.3.9
    epositoryorgantlrantlr4-runtime4.5.3antlr4-runtime-4.5.3.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-column1.8.2parquet-column-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-common1.8.2parquet-common-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-encoding1.8.2parquet-encoding-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-hadoop1.8.2parquet-hadoop-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-format2.3.1parquet-format-2.3.1.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-jackson1.8.2parquet-jackson-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-streaming_2.112.2.0spark-streaming_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-hive_2.112.2.0spark-hive_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositorycom	witterparquet-hadoop-bundle1.6.0parquet-hadoop-bundle-1.6.0.jar;E:softwaremaven3.3.9
    epositoryorgspark-projecthivehive-exec1.2.1.spark2hive-exec-1.2.1.spark2.jar;E:softwaremaven3.3.9
    epositorycommons-iocommons-io2.4commons-io-2.4.jar;E:softwaremaven3.3.9
    epositorycommons-langcommons-lang2.6commons-lang-2.6.jar;E:softwaremaven3.3.9
    epositoryjavolutionjavolution5.5.1javolution-5.5.1.jar;E:softwaremaven3.3.9
    epositorylog4japache-log4j-extras1.2.17apache-log4j-extras-1.2.17.jar;E:softwaremaven3.3.9
    epositoryorgantlrantlr-runtime3.4antlr-runtime-3.4.jar;E:softwaremaven3.3.9
    epositoryorgantlrstringtemplate3.2.1stringtemplate-3.2.1.jar;E:softwaremaven3.3.9
    epositoryantlrantlr2.7.7antlr-2.7.7.jar;E:softwaremaven3.3.9
    epositoryorgantlrST44.0.4ST4-4.0.4.jar;E:softwaremaven3.3.9
    epositorycomgooglecodejavaewahJavaEWAH0.3.2JavaEWAH-0.3.2.jar;E:softwaremaven3.3.9
    epositoryorgiq80snappysnappy0.2snappy-0.2.jar;E:softwaremaven3.3.9
    epositorystaxstax-api1.0.1stax-api-1.0.1.jar;E:softwaremaven3.3.9
    epository
    etsfopencsvopencsv2.3opencsv-2.3.jar;E:softwaremaven3.3.9
    epositoryorgspark-projecthivehive-metastore1.2.1.spark2hive-metastore-1.2.1.spark2.jar;E:softwaremaven3.3.9
    epositorycomjolboxonecp0.8.0.RELEASEonecp-0.8.0.RELEASE.jar;E:softwaremaven3.3.9
    epositorycommons-clicommons-cli1.2commons-cli-1.2.jar;E:softwaremaven3.3.9
    epositorycommons-loggingcommons-logging1.1.3commons-logging-1.1.3.jar;E:softwaremaven3.3.9
    epositoryorgapachederbyderby10.10.2.0derby-10.10.2.0.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-api-jdo3.2.6datanucleus-api-jdo-3.2.6.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-rdbms3.2.9datanucleus-rdbms-3.2.9.jar;E:softwaremaven3.3.9
    epositorycommons-poolcommons-pool1.5.4commons-pool-1.5.4.jar;E:softwaremaven3.3.9
    epositorycommons-dbcpcommons-dbcp1.4commons-dbcp-1.4.jar;E:softwaremaven3.3.9
    epositoryjavaxjdojdo-api3.0.1jdo-api-3.0.1.jar;E:softwaremaven3.3.9
    epositoryjavax	ransactionjta1.1jta-1.1.jar;E:softwaremaven3.3.9
    epositorycommons-httpclientcommons-httpclient3.1commons-httpclient-3.1.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-avatica1.2.0-incubatingcalcite-avatica-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-core1.2.0-incubatingcalcite-core-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-linq4j1.2.0-incubatingcalcite-linq4j-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epository
    ethydromaticeigenbase-properties1.1.5eigenbase-properties-1.1.5.jar;E:softwaremaven3.3.9
    epositoryorgapachehttpcomponentshttpclient4.5.2httpclient-4.5.2.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-mapper-asl1.9.13jackson-mapper-asl-1.9.13.jar;E:softwaremaven3.3.9
    epositorycommons-codeccommons-codec1.10commons-codec-1.10.jar;E:softwaremaven3.3.9
    epositoryjoda-timejoda-time2.9.3joda-time-2.9.3.jar;E:softwaremaven3.3.9
    epositoryorgjoddjodd-core3.5.2jodd-core-3.5.2.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-core3.2.10datanucleus-core-3.2.10.jar;E:softwaremaven3.3.9
    epositoryorgapache	hriftlibthrift0.9.3libthrift-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapache	hriftlibfb3030.9.3libfb303-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-streaming-kafka-0-10_2.112.2.0spark-streaming-kafka-0-10_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachekafkakafka_2.110.10.0.1kafka_2.11-0.10.0.1.jar;E:softwaremaven3.3.9
    epositorycom101teczkclient0.8zkclient-0.8.jar;E:softwaremaven3.3.9
    epositorycomyammermetricsmetrics-core2.2.0metrics-core-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langmodulesscala-parser-combinators_2.111.0.4scala-parser-combinators_2.11-1.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sql-kafka-0-10_2.112.2.0spark-sql-kafka-0-10_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachekafkakafka-clients0.10.0.1kafka-clients-0.10.0.1.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-client2.6.0hadoop-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-common2.6.0hadoop-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryxmlencxmlenc0.52xmlenc-0.52.jar;E:softwaremaven3.3.9
    epositorycommons-collectionscommons-collections3.2.1commons-collections-3.2.1.jar;E:softwaremaven3.3.9
    epositorycommons-configurationcommons-configuration1.6commons-configuration-1.6.jar;E:softwaremaven3.3.9
    epositorycommons-digestercommons-digester1.8commons-digester-1.8.jar;E:softwaremaven3.3.9
    epositorycommons-beanutilscommons-beanutils1.7.0commons-beanutils-1.7.0.jar;E:softwaremaven3.3.9
    epositorycommons-beanutilscommons-beanutils-core1.8.0commons-beanutils-core-1.8.0.jar;E:softwaremaven3.3.9
    epositorycomgoogleprotobufprotobuf-java2.5.0protobuf-java-2.5.0.jar;E:softwaremaven3.3.9
    epositorycomgooglecodegsongson2.2.4gson-2.2.4.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-auth2.6.0hadoop-auth-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryserverapacheds-kerberos-codec2.0.0-M15apacheds-kerberos-codec-2.0.0-M15.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryserverapacheds-i18n2.0.0-M15apacheds-i18n-2.0.0-M15.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryapiapi-asn1-api1.0.0-M20api-asn1-api-1.0.0-M20.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryapiapi-util1.0.0-M20api-util-1.0.0-M20.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-client2.6.0curator-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorghtracehtrace-core3.0.4htrace-core-3.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-hdfs2.6.0hadoop-hdfs-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgmortbayjettyjetty-util6.1.26jetty-util-6.1.26.jar;E:softwaremaven3.3.9
    epositoryxercesxercesImpl2.9.1xercesImpl-2.9.1.jar;E:softwaremaven3.3.9
    epositoryxml-apisxml-apis1.3.04xml-apis-1.3.04.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-app2.6.0hadoop-mapreduce-client-app-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-common2.6.0hadoop-mapreduce-client-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-client2.6.0hadoop-yarn-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-server-common2.6.0hadoop-yarn-server-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-shuffle2.6.0hadoop-mapreduce-client-shuffle-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-api2.6.0hadoop-yarn-api-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-core2.6.0hadoop-mapreduce-client-core-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-common2.6.0hadoop-yarn-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryjavaxxmlindjaxb-api2.2.2jaxb-api-2.2.2.jar;E:softwaremaven3.3.9
    epositoryjavaxxmlstreamstax-api1.0-2stax-api-1.0-2.jar;E:softwaremaven3.3.9
    epositoryjavaxservletservlet-api2.5servlet-api-2.5.jar;E:softwaremaven3.3.9
    epositorycomsunjerseyjersey-core1.9jersey-core-1.9.jar;E:softwaremaven3.3.9
    epositorycomsunjerseyjersey-client1.9jersey-client-1.9.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-jaxrs1.9.13jackson-jaxrs-1.9.13.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-xc1.9.13jackson-xc-1.9.13.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-jobclient2.6.0hadoop-mapreduce-client-jobclient-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-annotations2.6.0hadoop-annotations-2.6.0.jar com.spark.test.Test
    Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
    18/03/14 17:34:56 INFO SparkContext: Running Spark version 2.2.0
    18/03/14 17:34:57 ERROR SparkContext: Error initializing SparkContext.
    org.apache.spark.SparkException: A master URL must be set in your configuration
        at org.apache.spark.SparkContext.<init>(SparkContext.scala:376)
        at org.apache.spark.SparkContext$.getOrCreate(SparkContext.scala:2509)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:909)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:901)
        at scala.Option.getOrElse(Option.scala:121)
        at org.apache.spark.sql.SparkSession$Builder.getOrCreate(SparkSession.scala:901)
        at com.spark.test.Test$.main(Test.scala:11)
        at com.spark.test.Test.main(Test.scala)
    18/03/14 17:34:57 INFO SparkContext: Successfully stopped SparkContext
    Exception in thread "main" org.apache.spark.SparkException: A master URL must be set in your configuration
        at org.apache.spark.SparkContext.<init>(SparkContext.scala:376)
        at org.apache.spark.SparkContext$.getOrCreate(SparkContext.scala:2509)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:909)
        at org.apache.spark.sql.SparkSession$Builder$$anonfun$6.apply(SparkSession.scala:901)
        at scala.Option.getOrElse(Option.scala:121)
        at org.apache.spark.sql.SparkSession$Builder.getOrCreate(SparkSession.scala:901)
        at com.spark.test.Test$.main(Test.scala:11)
        at com.spark.test.Test.main(Test.scala)
    
    Process finished with exit code 1

    这里的错误是说要指明你的程序运行在什么地方

    在程序里加上这一句,指明我们现在在本地运行

     我们再运行一次,可以看到没问题了

    我们继续修改程序,加上这一句

     再次运行看看结果

     把相同的单词进行累加

     我们看看运行结果

    刚刚我们使用的是rdd的方式,接下来我们使用dataSet的方式

    dataSet可以近似的理解为数据库的一张张表

    我们运行的结果

     用空格切分单词

     

     运行结果

     

    E:softwarejdk1.8injava "-javaagent:E:softwareIDEAIntelliJ IDEA 2017.2.6libidea_rt.jar=62232:E:softwareIDEAIntelliJ IDEA 2017.2.6in" -Dfile.encoding=UTF-8 -classpath E:softwarejdk1.8jrelibcharsets.jar;E:softwarejdk1.8jrelibdeploy.jar;E:softwarejdk1.8jrelibextaccess-bridge-64.jar;E:softwarejdk1.8jrelibextcldrdata.jar;E:softwarejdk1.8jrelibextdnsns.jar;E:softwarejdk1.8jrelibextjaccess.jar;E:softwarejdk1.8jrelibextjfxrt.jar;E:softwarejdk1.8jrelibextlocaledata.jar;E:softwarejdk1.8jrelibext
    ashorn.jar;E:softwarejdk1.8jrelibextsunec.jar;E:softwarejdk1.8jrelibextsunjce_provider.jar;E:softwarejdk1.8jrelibextsunmscapi.jar;E:softwarejdk1.8jrelibextsunpkcs11.jar;E:softwarejdk1.8jrelibextzipfs.jar;E:softwarejdk1.8jrelibjavaws.jar;E:softwarejdk1.8jrelibjce.jar;E:softwarejdk1.8jrelibjfr.jar;E:softwarejdk1.8jrelibjfxswt.jar;E:softwarejdk1.8jrelibjsse.jar;E:softwarejdk1.8jrelibmanagement-agent.jar;E:softwarejdk1.8jrelibplugin.jar;E:softwarejdk1.8jrelib
    esources.jar;E:softwarejdk1.8jrelib
    t.jar;E:MycodeSparkStu	argetclasses;E:softwareScalalibscala-actors-2.11.0.jar;E:softwareScalalibscala-actors-migration_2.11-1.1.0.jar;E:softwareScalalibscala-library.jar;E:softwareScalalibscala-parser-combinators_2.11-1.0.4.jar;E:softwareScalalibscala-reflect.jar;E:softwareScalalibscala-swing_2.11-1.0.2.jar;E:softwareScalalibscala-xml_2.11-1.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-core_2.112.2.0spark-core_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro1.7.7avro-1.7.7.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-core-asl1.9.13jackson-core-asl-1.9.13.jar;E:softwaremaven3.3.9
    epositorycom	houghtworksparanamerparanamer2.3paranamer-2.3.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-compress1.4.1commons-compress-1.4.1.jar;E:softwaremaven3.3.9
    epositoryorg	ukaanixz1.0xz-1.0.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-mapred1.7.7avro-mapred-1.7.7-hadoop2.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-ipc1.7.7avro-ipc-1.7.7.jar;E:softwaremaven3.3.9
    epositoryorgapacheavroavro-ipc1.7.7avro-ipc-1.7.7-tests.jar;E:softwaremaven3.3.9
    epositorycom	witterchill_2.110.8.0chill_2.11-0.8.0.jar;E:softwaremaven3.3.9
    epositorycomesotericsoftwarekryo-shaded3.0.3kryo-shaded-3.0.3.jar;E:softwaremaven3.3.9
    epositorycomesotericsoftwareminlog1.3.0minlog-1.3.0.jar;E:softwaremaven3.3.9
    epositoryorgobjenesisobjenesis2.1objenesis-2.1.jar;E:softwaremaven3.3.9
    epositorycom	witterchill-java0.8.0chill-java-0.8.0.jar;E:softwaremaven3.3.9
    epositoryorgapachexbeanxbean-asm5-shaded4.4xbean-asm5-shaded-4.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-launcher_2.112.2.0spark-launcher_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-network-common_2.112.2.0spark-network-common_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgfusesourceleveldbjnileveldbjni-all1.8leveldbjni-all-1.8.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-annotations2.6.5jackson-annotations-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-network-shuffle_2.112.2.0spark-network-shuffle_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-unsafe_2.112.2.0spark-unsafe_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epository
    etjavadevjets3tjets3t0.9.3jets3t-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapachehttpcomponentshttpcore4.3.3httpcore-4.3.3.jar;E:softwaremaven3.3.9
    epositoryjavaxactivationactivation1.1.1activation-1.1.1.jar;E:softwaremaven3.3.9
    epositorymx4jmx4j3.0.2mx4j-3.0.2.jar;E:softwaremaven3.3.9
    epositoryjavaxmailmail1.4.7mail-1.4.7.jar;E:softwaremaven3.3.9
    epositoryorgouncycastlecprov-jdk15on1.51cprov-jdk15on-1.51.jar;E:softwaremaven3.3.9
    epositorycomjamesmurtyutilsjava-xmlbuilder1.0java-xmlbuilder-1.0.jar;E:softwaremaven3.3.9
    epository
    etiharderase642.3.8ase64-2.3.8.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-recipes2.6.0curator-recipes-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-framework2.6.0curator-framework-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachezookeeperzookeeper3.4.6zookeeper-3.4.6.jar;E:softwaremaven3.3.9
    epositorycomgoogleguavaguava16.0.1guava-16.0.1.jar;E:softwaremaven3.3.9
    epositoryjavaxservletjavax.servlet-api3.1.0javax.servlet-api-3.1.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-lang33.5commons-lang3-3.5.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-math33.4.1commons-math3-3.4.1.jar;E:softwaremaven3.3.9
    epositorycomgooglecodefindbugsjsr3051.3.9jsr305-1.3.9.jar;E:softwaremaven3.3.9
    epositoryorgslf4jslf4j-api1.7.16slf4j-api-1.7.16.jar;E:softwaremaven3.3.9
    epositoryorgslf4jjul-to-slf4j1.7.16jul-to-slf4j-1.7.16.jar;E:softwaremaven3.3.9
    epositoryorgslf4jjcl-over-slf4j1.7.16jcl-over-slf4j-1.7.16.jar;E:softwaremaven3.3.9
    epositorylog4jlog4j1.2.17log4j-1.2.17.jar;E:softwaremaven3.3.9
    epositoryorgslf4jslf4j-log4j121.7.16slf4j-log4j12-1.7.16.jar;E:softwaremaven3.3.9
    epositorycom
    ingcompress-lzf1.0.3compress-lzf-1.0.3.jar;E:softwaremaven3.3.9
    epositoryorgxerialsnappysnappy-java1.1.2.6snappy-java-1.1.2.6.jar;E:softwaremaven3.3.9
    epository
    etjpountzlz4lz41.3.0lz4-1.3.0.jar;E:softwaremaven3.3.9
    epositoryorg
    oaringbitmapRoaringBitmap0.5.11RoaringBitmap-0.5.11.jar;E:softwaremaven3.3.9
    epositorycommons-netcommons-net2.2commons-net-2.2.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-library2.11.8scala-library-2.11.8.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-jackson_2.113.2.11json4s-jackson_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-core_2.113.2.11json4s-core_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgjson4sjson4s-ast_2.113.2.11json4s-ast_2.11-3.2.11.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscalap2.11.0scalap-2.11.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-compiler2.11.0scala-compiler-2.11.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langmodulesscala-xml_2.111.0.1scala-xml_2.11-1.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-client2.22.2jersey-client-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxws
    sjavax.ws.rs-api2.0.1javax.ws.rs-api-2.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-api2.4.0-b34hk2-api-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-utils2.4.0-b34hk2-utils-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2externalaopalliance-repackaged2.4.0-b34aopalliance-repackaged-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2externaljavax.inject2.4.0-b34javax.inject-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2hk2-locator2.4.0-b34hk2-locator-2.4.0-b34.jar;E:softwaremaven3.3.9
    epositoryorgjavassistjavassist3.18.1-GAjavassist-3.18.1-GA.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-common2.22.2jersey-common-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxannotationjavax.annotation-api1.2javax.annotation-api-1.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseyundles
    epackagedjersey-guava2.22.2jersey-guava-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishhk2osgi-resource-locator1.0.1osgi-resource-locator-1.0.1.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycorejersey-server2.22.2jersey-server-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseymediajersey-media-jaxb2.22.2jersey-media-jaxb-2.22.2.jar;E:softwaremaven3.3.9
    epositoryjavaxvalidationvalidation-api1.1.0.Finalvalidation-api-1.1.0.Final.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycontainersjersey-container-servlet2.22.2jersey-container-servlet-2.22.2.jar;E:softwaremaven3.3.9
    epositoryorgglassfishjerseycontainersjersey-container-servlet-core2.22.2jersey-container-servlet-core-2.22.2.jar;E:softwaremaven3.3.9
    epositoryio
    etty
    etty-all4.0.43.Final
    etty-all-4.0.43.Final.jar;E:softwaremaven3.3.9
    epositoryio
    etty
    etty3.9.9.Final
    etty-3.9.9.Final.jar;E:softwaremaven3.3.9
    epositorycomclearspringanalyticsstream2.7.0stream-2.7.0.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-core3.1.2metrics-core-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-jvm3.1.2metrics-jvm-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-json3.1.2metrics-json-3.1.2.jar;E:softwaremaven3.3.9
    epositoryiodropwizardmetricsmetrics-graphite3.1.2metrics-graphite-3.1.2.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-databind2.6.5jackson-databind-2.6.5.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksoncorejackson-core2.6.5jackson-core-2.6.5.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksonmodulejackson-module-scala_2.112.6.5jackson-module-scala_2.11-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgscala-langscala-reflect2.11.7scala-reflect-2.11.7.jar;E:softwaremaven3.3.9
    epositorycomfasterxmljacksonmodulejackson-module-paranamer2.6.5jackson-module-paranamer-2.6.5.jar;E:softwaremaven3.3.9
    epositoryorgapacheivyivy2.4.0ivy-2.4.0.jar;E:softwaremaven3.3.9
    epositoryorooro2.0.8oro-2.0.8.jar;E:softwaremaven3.3.9
    epository
    et
    azorvinepyrolite4.13pyrolite-4.13.jar;E:softwaremaven3.3.9
    epository
    etsfpy4jpy4j0.10.4py4j-0.10.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-tags_2.112.2.0spark-tags_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachecommonscommons-crypto1.0.0commons-crypto-1.0.0.jar;E:softwaremaven3.3.9
    epositoryorgspark-projectsparkunused1.0.0unused-1.0.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sql_2.112.2.0spark-sql_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositorycomunivocityunivocity-parsers2.2.1univocity-parsers-2.2.1.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sketch_2.112.2.0spark-sketch_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-catalyst_2.112.2.0spark-catalyst_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjaninojanino3.0.0janino-3.0.0.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjaninocommons-compiler3.0.0commons-compiler-3.0.0.jar;E:softwaremaven3.3.9
    epositoryorgantlrantlr4-runtime4.5.3antlr4-runtime-4.5.3.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-column1.8.2parquet-column-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-common1.8.2parquet-common-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-encoding1.8.2parquet-encoding-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-hadoop1.8.2parquet-hadoop-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-format2.3.1parquet-format-2.3.1.jar;E:softwaremaven3.3.9
    epositoryorgapacheparquetparquet-jackson1.8.2parquet-jackson-1.8.2.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-streaming_2.112.2.0spark-streaming_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-hive_2.112.2.0spark-hive_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositorycom	witterparquet-hadoop-bundle1.6.0parquet-hadoop-bundle-1.6.0.jar;E:softwaremaven3.3.9
    epositoryorgspark-projecthivehive-exec1.2.1.spark2hive-exec-1.2.1.spark2.jar;E:softwaremaven3.3.9
    epositorycommons-iocommons-io2.4commons-io-2.4.jar;E:softwaremaven3.3.9
    epositorycommons-langcommons-lang2.6commons-lang-2.6.jar;E:softwaremaven3.3.9
    epositoryjavolutionjavolution5.5.1javolution-5.5.1.jar;E:softwaremaven3.3.9
    epositorylog4japache-log4j-extras1.2.17apache-log4j-extras-1.2.17.jar;E:softwaremaven3.3.9
    epositoryorgantlrantlr-runtime3.4antlr-runtime-3.4.jar;E:softwaremaven3.3.9
    epositoryorgantlrstringtemplate3.2.1stringtemplate-3.2.1.jar;E:softwaremaven3.3.9
    epositoryantlrantlr2.7.7antlr-2.7.7.jar;E:softwaremaven3.3.9
    epositoryorgantlrST44.0.4ST4-4.0.4.jar;E:softwaremaven3.3.9
    epositorycomgooglecodejavaewahJavaEWAH0.3.2JavaEWAH-0.3.2.jar;E:softwaremaven3.3.9
    epositoryorgiq80snappysnappy0.2snappy-0.2.jar;E:softwaremaven3.3.9
    epositorystaxstax-api1.0.1stax-api-1.0.1.jar;E:softwaremaven3.3.9
    epository
    etsfopencsvopencsv2.3opencsv-2.3.jar;E:softwaremaven3.3.9
    epositoryorgspark-projecthivehive-metastore1.2.1.spark2hive-metastore-1.2.1.spark2.jar;E:softwaremaven3.3.9
    epositorycomjolboxonecp0.8.0.RELEASEonecp-0.8.0.RELEASE.jar;E:softwaremaven3.3.9
    epositorycommons-clicommons-cli1.2commons-cli-1.2.jar;E:softwaremaven3.3.9
    epositorycommons-loggingcommons-logging1.1.3commons-logging-1.1.3.jar;E:softwaremaven3.3.9
    epositoryorgapachederbyderby10.10.2.0derby-10.10.2.0.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-api-jdo3.2.6datanucleus-api-jdo-3.2.6.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-rdbms3.2.9datanucleus-rdbms-3.2.9.jar;E:softwaremaven3.3.9
    epositorycommons-poolcommons-pool1.5.4commons-pool-1.5.4.jar;E:softwaremaven3.3.9
    epositorycommons-dbcpcommons-dbcp1.4commons-dbcp-1.4.jar;E:softwaremaven3.3.9
    epositoryjavaxjdojdo-api3.0.1jdo-api-3.0.1.jar;E:softwaremaven3.3.9
    epositoryjavax	ransactionjta1.1jta-1.1.jar;E:softwaremaven3.3.9
    epositorycommons-httpclientcommons-httpclient3.1commons-httpclient-3.1.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-avatica1.2.0-incubatingcalcite-avatica-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-core1.2.0-incubatingcalcite-core-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epositoryorgapachecalcitecalcite-linq4j1.2.0-incubatingcalcite-linq4j-1.2.0-incubating.jar;E:softwaremaven3.3.9
    epository
    ethydromaticeigenbase-properties1.1.5eigenbase-properties-1.1.5.jar;E:softwaremaven3.3.9
    epositoryorgapachehttpcomponentshttpclient4.5.2httpclient-4.5.2.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-mapper-asl1.9.13jackson-mapper-asl-1.9.13.jar;E:softwaremaven3.3.9
    epositorycommons-codeccommons-codec1.10commons-codec-1.10.jar;E:softwaremaven3.3.9
    epositoryjoda-timejoda-time2.9.3joda-time-2.9.3.jar;E:softwaremaven3.3.9
    epositoryorgjoddjodd-core3.5.2jodd-core-3.5.2.jar;E:softwaremaven3.3.9
    epositoryorgdatanucleusdatanucleus-core3.2.10datanucleus-core-3.2.10.jar;E:softwaremaven3.3.9
    epositoryorgapache	hriftlibthrift0.9.3libthrift-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapache	hriftlibfb3030.9.3libfb303-0.9.3.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-streaming-kafka-0-10_2.112.2.0spark-streaming-kafka-0-10_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachekafkakafka_2.110.10.0.1kafka_2.11-0.10.0.1.jar;E:softwaremaven3.3.9
    epositorycom101teczkclient0.8zkclient-0.8.jar;E:softwaremaven3.3.9
    epositorycomyammermetricsmetrics-core2.2.0metrics-core-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgscala-langmodulesscala-parser-combinators_2.111.0.4scala-parser-combinators_2.11-1.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachesparkspark-sql-kafka-0-10_2.112.2.0spark-sql-kafka-0-10_2.11-2.2.0.jar;E:softwaremaven3.3.9
    epositoryorgapachekafkakafka-clients0.10.0.1kafka-clients-0.10.0.1.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-client2.6.0hadoop-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-common2.6.0hadoop-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryxmlencxmlenc0.52xmlenc-0.52.jar;E:softwaremaven3.3.9
    epositorycommons-collectionscommons-collections3.2.1commons-collections-3.2.1.jar;E:softwaremaven3.3.9
    epositorycommons-configurationcommons-configuration1.6commons-configuration-1.6.jar;E:softwaremaven3.3.9
    epositorycommons-digestercommons-digester1.8commons-digester-1.8.jar;E:softwaremaven3.3.9
    epositorycommons-beanutilscommons-beanutils1.7.0commons-beanutils-1.7.0.jar;E:softwaremaven3.3.9
    epositorycommons-beanutilscommons-beanutils-core1.8.0commons-beanutils-core-1.8.0.jar;E:softwaremaven3.3.9
    epositorycomgoogleprotobufprotobuf-java2.5.0protobuf-java-2.5.0.jar;E:softwaremaven3.3.9
    epositorycomgooglecodegsongson2.2.4gson-2.2.4.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-auth2.6.0hadoop-auth-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryserverapacheds-kerberos-codec2.0.0-M15apacheds-kerberos-codec-2.0.0-M15.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryserverapacheds-i18n2.0.0-M15apacheds-i18n-2.0.0-M15.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryapiapi-asn1-api1.0.0-M20api-asn1-api-1.0.0-M20.jar;E:softwaremaven3.3.9
    epositoryorgapachedirectoryapiapi-util1.0.0-M20api-util-1.0.0-M20.jar;E:softwaremaven3.3.9
    epositoryorgapachecuratorcurator-client2.6.0curator-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorghtracehtrace-core3.0.4htrace-core-3.0.4.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-hdfs2.6.0hadoop-hdfs-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgmortbayjettyjetty-util6.1.26jetty-util-6.1.26.jar;E:softwaremaven3.3.9
    epositoryxercesxercesImpl2.9.1xercesImpl-2.9.1.jar;E:softwaremaven3.3.9
    epositoryxml-apisxml-apis1.3.04xml-apis-1.3.04.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-app2.6.0hadoop-mapreduce-client-app-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-common2.6.0hadoop-mapreduce-client-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-client2.6.0hadoop-yarn-client-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-server-common2.6.0hadoop-yarn-server-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-shuffle2.6.0hadoop-mapreduce-client-shuffle-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-api2.6.0hadoop-yarn-api-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-core2.6.0hadoop-mapreduce-client-core-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-yarn-common2.6.0hadoop-yarn-common-2.6.0.jar;E:softwaremaven3.3.9
    epositoryjavaxxmlindjaxb-api2.2.2jaxb-api-2.2.2.jar;E:softwaremaven3.3.9
    epositoryjavaxxmlstreamstax-api1.0-2stax-api-1.0-2.jar;E:softwaremaven3.3.9
    epositoryjavaxservletservlet-api2.5servlet-api-2.5.jar;E:softwaremaven3.3.9
    epositorycomsunjerseyjersey-core1.9jersey-core-1.9.jar;E:softwaremaven3.3.9
    epositorycomsunjerseyjersey-client1.9jersey-client-1.9.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-jaxrs1.9.13jackson-jaxrs-1.9.13.jar;E:softwaremaven3.3.9
    epositoryorgcodehausjacksonjackson-xc1.9.13jackson-xc-1.9.13.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-mapreduce-client-jobclient2.6.0hadoop-mapreduce-client-jobclient-2.6.0.jar;E:softwaremaven3.3.9
    epositoryorgapachehadoophadoop-annotations2.6.0hadoop-annotations-2.6.0.jar com.spark.test.Test
    Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
    18/03/14 20:41:05 INFO SparkContext: Running Spark version 2.2.0
    18/03/14 20:41:06 INFO SparkContext: Submitted application: HdfsTest
    18/03/14 20:41:06 INFO SecurityManager: Changing view acls to: Brave
    18/03/14 20:41:06 INFO SecurityManager: Changing modify acls to: Brave
    18/03/14 20:41:06 INFO SecurityManager: Changing view acls groups to: 
    18/03/14 20:41:06 INFO SecurityManager: Changing modify acls groups to: 
    18/03/14 20:41:06 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users  with view permissions: Set(Brave); groups with view permissions: Set(); users  with modify permissions: Set(Brave); groups with modify permissions: Set()
    18/03/14 20:41:07 INFO Utils: Successfully started service 'sparkDriver' on port 62269.
    18/03/14 20:41:07 INFO SparkEnv: Registering MapOutputTracker
    18/03/14 20:41:07 INFO SparkEnv: Registering BlockManagerMaster
    18/03/14 20:41:07 INFO BlockManagerMasterEndpoint: Using org.apache.spark.storage.DefaultTopologyMapper for getting topology information
    18/03/14 20:41:07 INFO BlockManagerMasterEndpoint: BlockManagerMasterEndpoint up
    18/03/14 20:41:07 INFO DiskBlockManager: Created local directory at C:UsersBraveAppDataLocalTemplockmgr-2ad95228-3532-4a24-b6b6-b09973c4a4ff
    18/03/14 20:41:07 INFO MemoryStore: MemoryStore started with capacity 1998.3 MB
    18/03/14 20:41:07 INFO SparkEnv: Registering OutputCommitCoordinator
    18/03/14 20:41:07 INFO Utils: Successfully started service 'SparkUI' on port 4040.
    18/03/14 20:41:07 INFO SparkUI: Bound SparkUI to 0.0.0.0, and started at http://192.168.56.1:4040
    18/03/14 20:41:07 INFO Executor: Starting executor ID driver on host localhost
    18/03/14 20:41:07 INFO Utils: Successfully started service 'org.apache.spark.network.netty.NettyBlockTransferService' on port 62282.
    18/03/14 20:41:07 INFO NettyBlockTransferService: Server created on 192.168.56.1:62282
    18/03/14 20:41:07 INFO BlockManager: Using org.apache.spark.storage.RandomBlockReplicationPolicy for block replication policy
    18/03/14 20:41:07 INFO BlockManagerMaster: Registering BlockManager BlockManagerId(driver, 192.168.56.1, 62282, None)
    18/03/14 20:41:07 INFO BlockManagerMasterEndpoint: Registering block manager 192.168.56.1:62282 with 1998.3 MB RAM, BlockManagerId(driver, 192.168.56.1, 62282, None)
    18/03/14 20:41:07 INFO BlockManagerMaster: Registered BlockManager BlockManagerId(driver, 192.168.56.1, 62282, None)
    18/03/14 20:41:07 INFO BlockManager: Initialized BlockManager: BlockManagerId(driver, 192.168.56.1, 62282, None)
    18/03/14 20:41:08 INFO SharedState: Setting hive.metastore.warehouse.dir ('null') to the value of spark.sql.warehouse.dir ('file:/E:/Mycode/SparkStu/spark-warehouse/').
    18/03/14 20:41:08 INFO SharedState: Warehouse path is 'file:/E:/Mycode/SparkStu/spark-warehouse/'.
    18/03/14 20:41:09 INFO StateStoreCoordinatorRef: Registered StateStoreCoordinator endpoint
    18/03/14 20:41:11 INFO FileSourceStrategy: Pruning directories with: 
    18/03/14 20:41:11 INFO FileSourceStrategy: Post-Scan Filters: 
    18/03/14 20:41:11 INFO FileSourceStrategy: Output Data Schema: struct<value: string>
    18/03/14 20:41:11 INFO FileSourceScanExec: Pushed Filters: 
    18/03/14 20:41:12 INFO CodeGenerator: Code generated in 321.911944 ms
    18/03/14 20:41:12 INFO CodeGenerator: Code generated in 9.798824 ms
    18/03/14 20:41:12 INFO MemoryStore: Block broadcast_0 stored as values in memory (estimated size 213.6 KB, free 1998.1 MB)
    18/03/14 20:41:12 INFO MemoryStore: Block broadcast_0_piece0 stored as bytes in memory (estimated size 20.2 KB, free 1998.1 MB)
    18/03/14 20:41:12 INFO BlockManagerInfo: Added broadcast_0_piece0 in memory on 192.168.56.1:62282 (size: 20.2 KB, free: 1998.3 MB)
    18/03/14 20:41:12 INFO SparkContext: Created broadcast 0 from show at Test.scala:17
    18/03/14 20:41:13 INFO FileSourceScanExec: Planning scan with bin packing, max size: 4194417 bytes, open cost is considered as scanning 4194304 bytes.
    18/03/14 20:41:13 INFO SparkContext: Starting job: show at Test.scala:17
    18/03/14 20:41:13 INFO DAGScheduler: Got job 0 (show at Test.scala:17) with 1 output partitions
    18/03/14 20:41:13 INFO DAGScheduler: Final stage: ResultStage 0 (show at Test.scala:17)
    18/03/14 20:41:13 INFO DAGScheduler: Parents of final stage: List()
    18/03/14 20:41:13 INFO DAGScheduler: Missing parents: List()
    18/03/14 20:41:13 INFO DAGScheduler: Submitting ResultStage 0 (MapPartitionsRDD[5] at show at Test.scala:17), which has no missing parents
    18/03/14 20:41:13 INFO MemoryStore: Block broadcast_1 stored as values in memory (estimated size 13.0 KB, free 1998.1 MB)
    18/03/14 20:41:13 INFO MemoryStore: Block broadcast_1_piece0 stored as bytes in memory (estimated size 6.1 KB, free 1998.1 MB)
    18/03/14 20:41:13 INFO BlockManagerInfo: Added broadcast_1_piece0 in memory on 192.168.56.1:62282 (size: 6.1 KB, free: 1998.3 MB)
    18/03/14 20:41:13 INFO SparkContext: Created broadcast 1 from broadcast at DAGScheduler.scala:1006
    18/03/14 20:41:13 INFO DAGScheduler: Submitting 1 missing tasks from ResultStage 0 (MapPartitionsRDD[5] at show at Test.scala:17) (first 15 tasks are for partitions Vector(0))
    18/03/14 20:41:13 INFO TaskSchedulerImpl: Adding task set 0.0 with 1 tasks
    18/03/14 20:41:13 INFO TaskSetManager: Starting task 0.0 in stage 0.0 (TID 0, localhost, executor driver, partition 0, PROCESS_LOCAL, 5268 bytes)
    18/03/14 20:41:13 INFO Executor: Running task 0.0 in stage 0.0 (TID 0)
    18/03/14 20:41:13 INFO CodeGenerator: Code generated in 13.617205 ms
    18/03/14 20:41:13 INFO FileScanRDD: Reading File path: file:///E:/Mycode/datas/stu.txt, range: 0-113, partition values: [empty row]
    18/03/14 20:41:13 INFO CodeGenerator: Code generated in 11.971125 ms
    18/03/14 20:41:13 INFO Executor: Finished task 0.0 in stage 0.0 (TID 0). 1745 bytes result sent to driver
    18/03/14 20:41:13 INFO TaskSetManager: Finished task 0.0 in stage 0.0 (TID 0) in 258 ms on localhost (executor driver) (1/1)
    18/03/14 20:41:13 INFO TaskSchedulerImpl: Removed TaskSet 0.0, whose tasks have all completed, from pool 
    18/03/14 20:41:13 INFO DAGScheduler: ResultStage 0 (show at Test.scala:17) finished in 0.284 s
    18/03/14 20:41:13 INFO DAGScheduler: Job 0 finished: show at Test.scala:17, took 0.483521 s
    18/03/14 20:41:13 INFO CodeGenerator: Code generated in 23.334109 ms
    +------+
    | value|
    +------+
    |hadoop|
    |hadoop|
    |  java|
    |  java|
    | spark|
    | spark|
    |  hive|
    | hbase|
    | sqoop|
    | sqoop|
    | mysql|
    | redit|
    | flume|
    | flume|
    |  join|
    |   hue|
    | scala|
    |python|
    +------+
    
    18/03/14 20:41:13 INFO SparkContext: Invoking stop() from shutdown hook
    18/03/14 20:41:13 INFO SparkUI: Stopped Spark web UI at http://192.168.56.1:4040
    18/03/14 20:41:13 INFO MapOutputTrackerMasterEndpoint: MapOutputTrackerMasterEndpoint stopped!
    18/03/14 20:41:13 INFO MemoryStore: MemoryStore cleared
    18/03/14 20:41:13 INFO BlockManager: BlockManager stopped
    18/03/14 20:41:14 INFO BlockManagerMaster: BlockManagerMaster stopped
    18/03/14 20:41:14 INFO OutputCommitCoordinator$OutputCommitCoordinatorEndpoint: OutputCommitCoordinator stopped!
    18/03/14 20:41:14 INFO SparkContext: Successfully stopped SparkContext
    18/03/14 20:41:14 INFO ShutdownHookManager: Shutdown hook called
    18/03/14 20:41:14 INFO ShutdownHookManager: Deleting directory C:UsersBraveAppDataLocalTempspark-2c920b38-6a7f-4914-a2ef-9ee345492414
    
    Process finished with exit code 0

    增加这个字段

     运行结果

    分组统计

     

    运行结果

     把最后的代码放上来

    package com.spark.test
    
    import org.apache.spark.sql.SparkSession
    import org.apache.spark.{SparkConf, SparkContext}
    object Test {
    
      def main(args: Array[String]): Unit = {
       val spark= SparkSession
           .builder
             .master("local")
             .appName("HdfsTest")
               .getOrCreate()
         val filePart = "E://Mycode/datas/stu.txt"
    //     val rdd= spark.sparkContext.textFile(filePart)
    //     val lines= rdd.flatMap(x => x.split(" ")).map(x=>(x,1)).reduceByKey((a,b)=>(a+b)).collect().toList
        //     println(lines)
        import spark.implicits._
        val dataSet= spark.read.textFile(filePart)
          .flatMap(x => x.split(" "))
          .map(x=>(x,1)).groupBy("_1").count()
          .show()
        
      }
    }

    现在我们把程序打包

    我们把代码稍微改一下

    package com.spark.test
    
    import org.apache.spark.sql.SparkSession
    import org.apache.spark.{SparkConf, SparkContext}
    object Test {
    
      def main(args: Array[String]): Unit = {
       val spark= SparkSession
           .builder
             .master("local")
             .appName("HdfsTest")
               .getOrCreate()
    
        val filePart = args(0)
        //  val filePart = "E://Mycode/datas/stu.txt"
    //     val rdd= spark.sparkContext.textFile(filePart)
    //     val lines= rdd.flatMap(x => x.split(" ")).map(x=>(x,1)).reduceByKey((a,b)=>(a+b)).collect().toList
        //     println(lines)
        import spark.implicits._
        val dataSet= spark.read.textFile(filePart)
          .flatMap(x => x.split(" "))
          .map(x=>(x,1)).groupBy("_1").count()
          .show()
    
      }
    }

     

     

     

     

    把这些都剔除掉

     剩下这两个

     

     

    打包完成了

    把这个包上传到我们的集群上

     

     这个是我们的数据文件

     

    我们把数据文件上传的hdfs上面去,先启动hdfs

     

     

     

     同时记得把zookeeper也启动了,不然会出问题的

     

    现在hdfs上创建一个目录

     

     把本地的文件上传

     

     我们在集群上跑一下

    bin/spark-submit --master local[2] /opt/jars/sparkStu.jar hdfs://bigdata-pro01.kfk.com:9000/user/datas/stu.txt

    可以看到跑下来的结果

     

  • 相关阅读:
    NanUI for Winform发布,让Winform界面设计拥有无限可能
    新浪微博.Net SDK第三版源代码和示例【最后一次更新了】
    写个C#命令行参数解析的小工具
    Mac安装Windows 10的简明教程
    自己动手,让Entity Framework Power Tools在VS2015重放光彩
    C++CLI使用.net委托,*Callback注意"this"
    【转】IIS上的反向代理
    asp.net mvc 验证码
    win2008R2 下解决关于mysql odbc无法正常工作问题
    中国健康医学教育网
  • 原文地址:https://www.cnblogs.com/braveym/p/8562917.html
Copyright © 2011-2022 走看看