zoukankan      html  css  js  c++  java
  • java spark远程调用问题

    从IntelliJ IDEA提交应用(submit Application),从spark webUI上能观察到集群在不停地add、remove Executor,无法正常执行。代码及截图如下:

     

     代码:

    SparkConf conf = new SparkConf()
                    .setSparkHome(sparkHome)
                    .setAppName(appName)
            conf.set("spark.serializer", "org.apache.spark.serializer.KryoSerializer");
    或者
       SparkSession spark = SparkSession.builder().master("spark://server01:7077").appName("HBASEDATA")
                    .getOrCreate();

    控制台输出信息:

    20/07/21 10:29:06 INFO SparkUI: Bound SparkUI to 0.0.0.0, and started at http://DESKTOP-A56927L:4040
    20/07/21 10:29:06 INFO StandaloneAppClient$ClientEndpoint: Connecting to master spark://server01:7077...
    20/07/21 10:29:06 INFO TransportClientFactory: Successfully created connection to bikini-bottom/192.168.0.91:7077 after 149 ms (0 ms spent in bootstraps)
    ... ...
    20/07/21 10:29:34 INFO CoarseGrainedSchedulerBackend$DriverEndpoint: Asked to remove non-existent executor 13
    20/07/21 10:29:34 INFO StandaloneSchedulerBackend: Granted executor ID app-20190721102906-0002/14 on hostPort 192.168.0.91:46381 with 1 core(s), 800.0 MB RAM
    20/07/21 10:29:34 INFO StandaloneAppClient$ClientEndpoint: Executor updated: app-20190721102906-0002/14 is now RUNNING
    20/07/21 10:29:34 INFO StandaloneAppClient$ClientEndpoint: Executor updated: app-20190721102906-0002/12 is now EXITED (Command exited with code 1)
    20/07/21 10:29:34 INFO StandaloneSchedulerBackend: Executor app-20190721102906-0002/12 removed: Command exited with code 1
    20/07/21 10:29:34 INFO BlockManagerMaster: Removal of executor 12 requested

    Spark集群Executor分配情况:

     

     Executor的报错信息:

     ---------------------------------------------------------------------------------------------------------------------

    解决:

    配置主机:

     

    修改代码:

    SparkConf conf = new SparkConf()
                    .setSparkHome(sparkHome)
                    .setAppName(appName)
                    //指定driver 的hosts-name
                    .set("spark.driver.host","DESKTOP-T5HC2II")
                    //指定driver的服务端口
                    .set("spark.driver.port","9095")
                    //内存大小
                    .set("spark.executor.memory","800m")
                    //CPU核心数
                    .set("spark.driver.cores","1")
                    .setMaster(master);
            conf.set("spark.serializer", "org.apache.spark.serializer.KryoSerializer");
    或者
            SparkSession spark = SparkSession.builder().master("spark://server01:7077").appName("HBASEDATA")
                    //指定driver 的hosts-name
                    .config("spark.driver.host","DESKTOP-T5HC2II")
                    //指定driver的服务端口
                    .config("spark.driver.port","9092")
                    .getOrCreate();
  • 相关阅读:
    摊余成本通俗解释
    存货跌价准备与折旧
    发生认定是什么
    金融负债与权益工具的区分
    应收退货成本
    使用Python和SAS 编程代写Viya分析社交网络
    用R语言编程代写和python进行社交网络中的社区检测
    python编程代写隶属关系图模型:基于模型的网络中密集重叠社区检测方法
    R语言编程代写确定聚类的最佳簇数:3种聚类优化方法
    R语言编程代写最优聚类数目k改进kmean聚类算法
  • 原文地址:https://www.cnblogs.com/Mr-lin66/p/13519103.html
Copyright © 2011-2022 走看看