zoukankan      html  css  js  c++  java
  • hue安装与部署

    运行环境

    • centOS 6.6
    • hadoop 2.4.0
    • hive 1.2.0
    • spark 1.4.1
    • HUE 3.9

    介绍:

    Hue是一个开源的Apache Hadoop UI系统,最早是由Cloudera Desktop演化而来,由Cloudera贡献给开源社区,它是基于Python Web框架Django实现的。通过使用Hue我们可以在浏览器端的Web控制台上与Hadoop集群进行交互来分析处理数据,例如操作HDFS上的数据,运行MapReduce Job等等。

    HUE功能:

    • 数据库查询编辑器,支持 Hive, Impala, MySql, PostGres, Sqlite and Oracle
    • 动态查询仪表盘,支持 Solr
    • 支持 Spark 编辑器和仪表盘
    • 浏览器查看状态,支持 YARN, HDFS, Hive table Metastore, HBase, ZooKeeper
    • 支持 Pig Editor, Sqoop2, Oozie workflows 编辑器和仪表盘
    • 将数据导入hdfs

    在centOS编译需要的依赖

    • ant
    • asciidoc
    • cyrus-sasl-devel
    • cyrus-sasl-gssapi
    • gcc
    • gcc-c++
    • krb5-devel
    • libtidy (for unit tests only)
    • libxml2-devel
    • libxslt-devel
    • make
    • mvn (from maven package or maven3 tarball)
    • mysql
    • mysql-devel
    • openldap-devel
    • python-devel
    • sqlite-devel
    • openssl-devel (for version 7+)
    • gmp-devel

    添加maven源

    sudo wget http://repos.fedorapeople.org/repos/dchen/apache-maven/epel-apache-maven.repo -O /etc/yum.repos.d/epel-apache-maven.repo

    安装依赖

    sudo yum install apache-maven ant asciidoc cyrus-sasl-devel cyrus-sasl-gssapi gcc gcc-c++ krb5-devel libxml2-devel libxslt-devel make mysql mysql-devel openldap-devel python-devel sqlite-devel gmp-devel

    源码下载

    下载,编译源码(hue 3.9),编译时间较长

    git clone https://github.com/cloudera/hue.git branch-3.9
    cd branch-3.9
    make apps
    

    编译完后也可以选择安装

    make install
    

    desktop/conf/pseudo-distributed.ini中修改配置
    假设所有服务都运行在testHost下

    [desktop]
       # hue webServer 地址和端口号
       http_host=192.168.47.213
       http_port=8001
    
      # Webserver runs as this user
       server_user=hue
       server_group=hue
    
      # This should be the Hue admin and proxy user
       default_user=hue
    
      # This should be the hadoop cluster admin
       default_hdfs_superuser=admin
    
    [hadoop]
        fs_defaultfs=tdhdfs
        webhdfs_url=http://192.168.47.216:50070/webhdfs/v1/
        hadoop_conf_dir=/usr/install/hadoop/etc/hadoop/conf
    
    [[yarn_clusters]]
          # Enter the host on which you are running the ResourceManager
          resourcemanager_host=192.168.47.216
    
          # The port where the ResourceManager IPC listens on
          resourcemanager_port=23140
    
          # Whether to submit jobs to this cluster
          submit_to=True
    
          # URL of the ResourceManager API
          resourcemanager_api_url=http://192.168.47.216:23188
    
          # URL of the ProxyServer API
          proxy_api_url=http://192.168.47.216:8888
    
          # URL of the HistoryServer API
          history_server_api_url=http://192.168.47.216:19888
    
    [beeswax]
      # If Kerberos security is enabled, use fully-qualified domain name (FQDN).
      hive_server_host=192.168.47.213
    
      # Port where HiveServer2 Thrift server runs on.
       hive_server_port=10000
    
      # Hive configuration directory, where hive-site.xml is located
       hive_conf_dir=/usr/install/apache-hive-1.2.0-bin/conf
    
    [zookeeper]
    host_ports=zk1:2181,zk2:2181,zk3:2181
    

    启动

    build/env/bin/supervisor    
    

    运行界面

    这里写图片描述

    这里写图片描述

    这里写图片描述

    一些问题

    使用hive 0.13.0时会报

    can't fetch result之类的错误
    

    是版本原因,升级hive版本为1.2.0问题消失

    连接hive error

    Could not install table: Error creating table sample_07: Could not start SASL: Error in sasl_client_start (-4) SASL(-4): no mechanism available: No worthy mechs found (code THRIFTTRANSPORT): TTransportException('Could not start SASL: Error in sasl_client_start (-4) SASL(-4): no mechanism available: No worthy mechs found',).

    解决方案:

    yum -y install cyrus-sasl-plain cyrus-sasl-devel cyrus-sasl-gssapi
    
  • 相关阅读:
    Eclipse配置SVN的几种方法及使用详情
    重新定位svn地址的方法(windows和linux),svn switch(sw)的帮助信息
    SVN版本库修改URL路径或者IP地址
    DEA中MAVEN项目有多个子目录,如何加载构建
    DBA的40条军规
    搜索 比MySQL快10倍?这可能是目前AWS Aurora最详解读!
    HttpClient 4.5.3 get和post请求
    基于HttpClient4.5.2实现的HttpClient工具类
    httpClient使用总结
    为何只能在其关联的线程内启动timer?(Qt会检查一致性,否则就不执行)
  • 原文地址:https://www.cnblogs.com/xiaomaohai/p/6158045.html
Copyright © 2011-2022 走看看