zoukankan      html  css  js  c++  java
  • hive中使用spark执行引擎的常用参数

    set hive.execution.engine=spark;
    set hive.exec.parallel=true;
    set hive.exec.parallel.thread.number=8;
    set hive.exec.compress.intermediate=true;
    set hive.intermediate.compression.codec=org.apache.hadoop.io.compress.SnappyCodec;
    set hive.intermediate.compression.type=BLOCK;
    set hive.exec.compress.output=true;
    set mapred.output.compression.codec=org.apache.hadoop.io.compress.GzipCodec;
    set mapred.output.compression.type=BLOCK;

    set mapreduce.job.queuename=uat2;(设置hive的运行队列)

    set hive.exec.reducers.max=2400;
    set mapreduce.job.reduces=2004;
    set hive.exec.reducers.bytes.per.reducer=24;

    set mapred.child.java.opts = -Xmx3024m;
    set mapreduce.reduce.memory.mb =4096;
    set mapreduce.map.memory.mb= 4096;

    set hive.exec.parallel.thread.number=16;

    -hiveconf  hive.exec.parallel.thread.number=16 

  • 相关阅读:
    django学习笔记1
    排序多重排序
    06计算列
    填充日期序列
    行,列单元格
    读取excel文件
    监控文本
    天干地支纪年法
    Mysql基础
    JDBC基础
  • 原文地址:https://www.cnblogs.com/ssqq5200936/p/13704220.html
Copyright © 2011-2022 走看看