zoukankan      html  css  js  c++  java
  • spark 相关配置 shuffle 相关配置选项

    在master的/conf/spark-defaults.conf中配置

    spark.shuffle.service.enabled true

    spark.shuffle.service.port 7337

    但是在从节点的spark-defaults.conf中注释上面两个配置选项,不然web 界面中将看不到从节点

    spark-defaults.conf:

    spark.local.dir /mnt/diskb/sparklocal,/mnt/diskc/sparklocal,/mnt/diskd/sparklocal,/mnt/diske/sparklocal,/mnt/diskf/sparklocal,/mnt/diskg/sparklocal //shuffle 中产生的临时文件的路径
    spark.eventLog.enabled true                               //记录spark日志
    spark.eventLog.dir hdfs://nameservice1/spark-log  //日志保存在hdfs上
    spark.network.timeout 450

    spark.dynamicAllocation.enabled true

    spark.dynamicAllocation.minExecutors 8

    spark.dynamicAllocation.maxExecutors 30
    spark.dynamicAllocation.schedulerBacklogTimeout 1s
    spark.dynamicAllocation.sustainedSchedulerBacklogTimeout 5s

    spark.io.compression.codec snappy

    spark-env.sh:

    export JAVA_HOME=/usr/java/jdk1.7.0_67-cloudera
    export SPARK_MASTER_IP=10.130.2.20
    export SPARK_MASTER_PORT=7077
    export SPARK_WORKER_CORES=12
    export SPARK_EXECUTOR_INSTANCES=1
    export SPARK_WORKER_MEMORY=48g
    export SPARK_WORKER_DIR=/mnt/diskb/sparkwork,/mnt/diskc/sparkwork,/mnt/diskd/sparkwork,/mnt/diske/sparkwork,/mnt/diskf/sparkwork,/mnt/diskg/sparkwork
    export SPARK_LOCAL_DIRS=/mnt/diske/sparklocal,/mnt/diskb/sparklocal,/mnt/diskc/sparklocal,/mnt/diskd/sparklocal,/mnt/diskf/sparklocal,/mnt/diskg/sparklocal
    export HADOOP_HOME=/opt/cloudera/parcels/CDH/lib/hadoop
    export HADOOP_CONF_DIR=/etc/hadoop/conf/
    export SPARK_DAEMON_MEMORY=12g
    #export SPARK_DAEMON_JAVA_OPTS="-Dspark.deploy.recoveryMode=ZOOKEEPER -Dspark.deploy.zookeeper.url=bdc40.hexun.com:2181,bdc41.hexun.com:2181,bdc46.hexun.com:2181,bdc53.hexun.com:2181,bdc54.hexun.com:2181 -Dspark.deploy.zookeeper.dir=/spark"
    #export SPARK_DAEMON_JAVA_OPTS="-Dspark.deploy.recoveryMode=FILESYSTEM -Dspark.deploy.recoveryDirectory=/opt/modules/spark/recovery"
    export JAVA_LIBRARY_PATH=$JAVA_LIBRARY_PATH:$HADOOP_HOME/lib/native
    export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$HADOOP_HOME/lib/native
    export SPARK_LIBRARY_PATH=$SPARK_LIBRARY_PATH:$HADOOP_HOME/lib/native
    export SPARK_CLASSPATH=$SPARK_CLASSPATH:$HADOOP_HOME/lib/snappy-java-1.0.4.1.jar

  • 相关阅读:
    Spark SQL学习笔记
    《空空》陈粒
    支持向量机
    p.Value越显著,X变量越重要嘛?
    回归的武林绝学
    Neural Collaborative Filtering论文笔记
    make 学习笔记
    『并发包入坑指北』之阻塞队列
    线程池中你不容错过的一些细节
    利用策略模式优化过多 if else 代码
  • 原文地址:https://www.cnblogs.com/sunt9/p/6685907.html
Copyright © 2011-2022 走看看