Hue從源碼編譯到支援Hive全流程

安裝Hue

測試叢集：hadoop101 hadoop102 hadoop103

叢集配置：阿裡雲3台雲伺服器 centos7.5 2core 8Gmemory

叢集架構：

hue：hadoop102

hadoop：hadoop101 hadoop102 hadoop103

hive：hadoop 101

mysql: hadoop101

zookeeper: hadoop101 hadoop102 hadoop103

kafka: hadoop101 hadoop102 hadoop103

spark: hadoop101

可直接使用編譯後的hue,修改配置成自己叢集配置即可

本文預設已配置好上述叢集中除hue以外的所有架構

注意：結合自己的叢集修相關節點配置即可

（1）在102機器上裝hue,建立software檔案夾，上傳壓縮包并解壓

[[email protected] software]# unzip hue-master.zip -d /opt/module/

（2）安裝環境

[[email protected] software]# cd /opt/module/hue-master/
[[email protected] hue-master]# sudo yum install ant asciidoc cyrus-sasl-devel cyrus-sasl-gssapi cyrus-sasl-plain gcc gcc-c++ krb5-devel libffi-devel libxml2-devel libxslt-devel make mysql mysql-devel openldap-devel python-devel sqlite-devel gmp-devel

（3）編譯安裝,編譯完成後生成build檔案夾

[[email protected] hue-master]# make apps

（4）修改hadoop配置檔案，因為叢集HA模式，是以采用httpfs模式

[[email protected] hue-master]# cd /opt/module/hadoop-3.1.3/etc/hadoop/
[[email protected] hadoop]# vim hdfs-site.xml 
<property>
  <name>dfs.webhdfs.enabled</name>
  <value>true</value>
</property>
[[email protected] hadoop]# vim core-site.xml
<property>
  <name>hadoop.proxyuser.root.hosts</name>
  <value>*</value>
</property>
<property>
  <name>hadoop.proxyuser.root.groups</name>
  <value>*</value>
</property>
<property>
  <name>hadoop.proxyuser.hue.hosts</name>
  <value>*</value>
</property>
<property>
  <name>hadoop.proxyuser.hue.groups</name>
  <value>*</value>	
</property>
[[email protected] hadoop]# vim httpfs-site.xml
<property>
  <name>hadoop.proxyuser.root.hosts</name>
  <value>*</value>
</property>
<property>
  <name>hadoop.proxyuser.root.groups</name>
  <value>*</value>
</property>
<property>
  <name>hadoop.proxyuser.hue.hosts</name>
  <value>*</value>
</property>
<property>
  <name>hadoop.proxyuser.hue.groups</name>
  <value>*</value>
</property>

（5）分發配置檔案

[[email protected] etc]# pwd
/opt/module/hadoop-3.1.3/etc
[[email protected] etc]scp -r hadoop/ hadoop101:/opt/module/hadoop-3.1.3/etc/
[[email protected] etc]scp -r hadoop/ hadoop103:/opt/module/hadoop-3.1.3/etc/

（6）修改hue配置檔案，內建hdfs

[[email protected] hadoop]# cd /opt/module/hue-master/desktop/conf
[[email protected] conf]# vim pseudo-distributed.ini
 http_host=hadoop102
  http_port=8000
[[[default]]]
      # Enter the filesystem uri
       fs_defaultfs=hdfs://mycluster:8020
logical_name=mycluster
     webhdfs_url=http://hadoop102:14000/webhdfs/v1
 hadoop_conf_dir=/opt/module/hadoop-3.1.3/etc/hadoop/conf

（7）內建yarn，以下logic_name需要對應yarn-site.xml檔案裡的配置，注意：一定要解開HA标簽前面的注釋,否則會報錯

[[yarn_clusters]]

    [[[default]]]
      # Enter the host on which you are running the ResourceManager
      ## resourcemanager_host=mycluster

      # The port where the ResourceManager IPC listens on
      ## resourcemanager_port=8032

      # Whether to submit jobs to this cluster
      	submit_to=True

      # Resource Manager logical name (required for HA)
      logical_name=rm1

      # Change this if your YARN cluster is Kerberos-secured
      ## security_enabled=false

      # URL of the ResourceManager API
      resourcemanager_api_url=http://hadoop101:8088

      # URL of the ProxyServer API
      ## proxy_api_url=http://hadoop101:8088

      # URL of the HistoryServer API
      ## history_server_api_url=http://localhost:19888

     [[[ha]]]
      # Resource Manager logical name (required for HA)
      logical_name=rm2

      # Un-comment to enable
      submit_to=True

      # URL of the ResourceManager API
      resourcemanager_api_url=http://hadoop103:8088

      # ...

（8）內建mysql

[[database]]
    # Database engine is typically one of:
    # postgresql_psycopg2, mysql, sqlite3 or oracle.
    #
    # Note that for sqlite3, 'name', below is a path to the filename. For other backends, it is the database name.
    # Note for Oracle, options={"threaded":true} must be set in order to avoid crashes.
    # Note for Oracle, you can use the Oracle Service Name by setting "host=" and "port=" and then "name=<host>:<port>/<service_name>".
    # Note for MariaDB use the 'mysql' engine.
     engine=mysql
     host=hadoop101
     port=3306
     user=root
     password=123456
  # conn_max_age option to make database connection persistent value in seconds
    # https://docs.djangoproject.com/en/1.9/ref/databases/#persistent-connections
    ## conn_max_age=0
    # Execute this script to produce the database password. This will be used when 'password' is not set.
    ## password_script=/path/script
    name=hue
在mysql中建立hue庫
[[email protected] apache-hive-3.1.2-bin]# mysql -uroot -p123456
mysql> create database hue;

 ## [[[mysql]]]
      # Name to show in the UI.
      ## nice_name="My SQL DB"

      # For MySQL and PostgreSQL, name is the name of the database.
      # For Oracle, Name is instance of the Oracle server. For express edition
      # this is 'xe' by default.
       name=huemetastore
# Database backend to use. This can be:
      # 1. mysql
      # 2. postgresql
      # 3. oracle
       engine=mysql
       port=3306
       user=root
       password=123456

（9）內建hive

[beeswax]

  # Host where HiveServer2 is running.
  # If Kerberos security is enabled, use fully-qualified domain name (FQDN).
   hive_server_host=hadoop101
# Port where HiveServer2 Thrift server runs on.
   hive_server_port=10000
 hive_conf_dir=/opt/module/apache-hive-3.1.2-bin/conf/

（10）停止叢集，重新開機

[[email protected] software]# /opt/module/apache-zookeeper-3.5.7-bin/bin/zkServer.sh start
[[email protected] software]# /opt/module/apache-zookeeper-3.5.7-bin/bin/zkServer.sh start
[[email protected] software]# /opt/module/apache-zookeeper-3.5.7-bin/bin/zkServer.sh start
[[email protected] software]# start-all.sh

（11）啟動hive服務

[[email protected] apache-hive-3.1.2-bin]# nohup hive --service metastore >metasotre.log>&1 &
[[email protected] apache-hive-3.1.2-bin]# nohup hive --service hiveserver2 >hiveserver2.log >&1 &

（12）新增使用者hue,并修改檔案夾權限

[[email protected] hue-master]# useradd hue
[[email protected] hue-master]# passwd hue
[[email protected] hue-master]# chown -R hue:hue /opt/module/hue-master/

（13）初始化資料庫

[[email protected] hue-master]# build/env/bin/hue syncdb
[[email protected] hue-master]# build/env/bin/hue migrate

（14）啟動hue,注意啟動的時候必須啟動hive的兩個服務,并且metastore和hiveserver2兩個服務必須啟動，10000端口占用着

[[email protected] hue-master]# build/env/bin/supervisor

（15）登入hue,執行hive sql

Hue從源碼編譯到支援Hive全流程

Hue從源碼編譯到支援Hive全流程

安裝Hue

繼續閱讀

jdk1.7+Eclipse+Maven3.5+Hadoop2.7.3建構hadoop項目

HDFS指令行工具

【51CTO學院三周年】自學路上的伴侶

線上教育巨頭多鄰國Duolingo入華一周年，中國市場馬力全開

【分類算法】什麼是分類算法定義分類與聚類分類過程方法

申請評分模型拒絕推斷（RI）方法申請評分模型拒絕推斷（RI）方法

Sql優化一：sql語句優化

Nacos 2.0 更新前後性能對比壓測

尚矽谷—韓順平—圖解 Java設計模式（結構型）（55～）

Storm編譯打包過程中遇到的一些問題及解決方法

MapReduce的幾個企業級經典面試案例MapReduce的幾個企業級經典面試案例

9.spark Core 進階2--Cashe

淺談企業活動中進行資料分析的重要性

Ambari介紹和架構原理

NOSQL安全攻擊

win10本地scala和spark安裝安裝scala安裝spark