zoukankan html css js c++ java

2.hive里的增删改查

1.hive的增删改查

查询数据库

    hive> show databases;
    OK
    default
    Time taken: 0.254 seconds, Fetched: 1 row(s)
    hive>
    #defalut是默认数据库

创建数据库

hive> create database liuyao;
OK
Time taken: 0.055 seconds

在hdfs里查看相关库数据

#hdfs dfs -lsr /

lsr: DEPRECATED: Please use 'ls -R' instead.
drwxr-xr-x   - root supergroup          0 2017-07-12 10:45 /user/hive
drwxr-xr-x   - root supergroup          0 2017-07-12 10:45 /user/hive/warehouse
drwxr-xr-x   - root supergroup          0 2017-07-12 10:45 /user/hive/warehouse/liuyao.db
#每创建一个数据库都会在对应的hdfs里的/user/hive/warehouse生成一个xx.db的目录
#一个数据库对应一个目录

在mysql里查看相关库数据

mysql> use dbhive;
mysql> select * from DBS;
+-------+-----------------------+------------------------------------------------+---------+------------+------------+
| DB_ID | DESC                  | DB_LOCATION_URI                                | NAME    | OWNER_NAME | OWNER_TYPE |
+-------+-----------------------+------------------------------------------------+---------+------------+------------+
|     1 | Default Hive database | hdfs://hadoop-1/user/hive/warehouse            | default | public     | ROLE       |
|     2 | NULL                  | hdfs://hadoop-1/user/hive/warehouse/liuyao.db  | liuyao  | root       | USER       |
+-------+-----------------------+------------------------------------------------+---------+------------+------------+
#创建的库相关信息都维护在mysql里

创建表

hive> use liuyao;
OK
Time taken: 0.057 seconds
hive> create table test(id int, name string);
OK
Time taken: 0.458 seconds
hive>

在mysql里查看相关表数据

mysql> select * from TBLS;
+--------+-------------+-------+------------------+-------+-----------+-------+----------+---------------+--------------------+--------------------+
| TBL_ID | CREATE_TIME | DB_ID | LAST_ACCESS_TIME | OWNER | RETENTION | SD_ID | TBL_NAME | TBL_TYPE      | VIEW_EXPANDED_TEXT | VIEW_ORIGINAL_TEXT |
+--------+-------------+-------+------------------+-------+-----------+-------+----------+---------------+--------------------+--------------------+
|      1 |  1499871180 |     2 |                0 | root  |         0 |     1 | test     | MANAGED_TABLE | NULL               | NULL               |
+--------+-------------+-------+------------------+-------+-----------+-------+----------+---------------+--------------------+--------------------+
1 row in set (0.00 sec)
#DB_ID关联管理库ID

在hdfs里产生相关表数据

#hdfs dfs -lsr /
drwxr-xr-x   - root supergroup          0 2017-07-12 10:53 /user/hive/warehouse/liuyao.db/test

插入数据

hive> use liuyao;
OK
Time taken: 0.133 seconds
hive> insert into test values(2,"tom");
WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
Query ID = root_20170712111559_38d90f7b-845d-441c-9fc2-77ceef15c446
Total jobs = 3
Launching Job 1 out of 3
Number of reduce tasks is set to 0 since there's no reduce operator
Starting Job = job_1498146166646_0003, Tracking URL = http://hadoop-1:8088/proxy/application_1498146166646_0003/
Kill Command = /soft/hadoop-2.7.3/bin/hadoop job  -kill job_1498146166646_0003
Hadoop job information for Stage-1: number of mappers: 1; number of reducers: 0
2017-07-12 11:16:08,543 Stage-1 map = 0%,  reduce = 0%
2017-07-12 11:16:14,960 Stage-1 map = 100%,  reduce = 0%, Cumulative CPU 1.65 sec
MapReduce Total cumulative CPU time: 1 seconds 650 msec
Ended Job = job_1498146166646_0003
Stage-4 is selected by condition resolver.
Stage-3 is filtered out by condition resolver.
Stage-5 is filtered out by condition resolver.
Moving data to directory hdfs://hadoop-1/user/hive/warehouse/liuyao.db/test/.hive-staging_hive_2017-07-12_11-15-59_225_3827478452355860952-1/-ext-10000
Loading data to table liuyao.test
MapReduce Jobs Launched:
Stage-Stage-1: Map: 1   Cumulative CPU: 1.65 sec   HDFS Read: 4060 HDFS Write: 73 SUCCESS
Total MapReduce CPU Time Spent: 1 seconds 650 msec
OK
Time taken: 17.353 seconds
hive>

insert会产生mr作业

查看数据

hive>
hive> select * from test;
OK
1	tom
2	tom
Time taken: 0.276 seconds, Fetched: 2 row(s)
hive>
#select * from test 默认从hdfs里的相关文件里读取数据

查看全文

相关阅读:
linux挂载
 kafka
使用python27+flask
arcpy 10.8计算最短路径，并复制出结果
 arcgis pro2.5使用试用
 arcgis engine create featureclass , gp工具
 python和c#的 opencv文字区域识别
 C#调用带参数并输出控制台的python的EXE程序
 arcgis10.8中 python27打包exe文件
 arcgis engine指定范围导出屏幕图片

原文地址：https://www.cnblogs.com/liu-yao/p/2hive-li-de-zeng-shan-gai-cha.html