一、Hive安装(略)
二、数据导入
(1)数据的分隔符在创建表的时候指定:
create table if not exists test(name string,age int) row format delimited fields terminated by ' ';
指定之后在导入数据时就会以 也就是tab分隔字串:
ngsan 30
lisi 50
wangwu 60
peiqi 6
(2)创建Hive外部表
CREATE external TABLE tt (
stime STRING,
assid STRING,
ssid STRING,
other STRING,
mac STRING,
servicecode STRING,
host STRING,
apname STRING,
termtype STRING
) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '
' LOCATION '/logstash';
此时会自动创建元数据,无需load
(3)创建Hive内部表
CREATE external TABLE tt (
stime STRING,
assid STRING,
ssid STRING,
other STRING,
mac STRING,
servicecode STRING,
host STRING,
apname STRING,
termtype STRING
) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '
';
LOAD DATA INPATH 'hdfs://cdh01:8020/logstash/test.txt' overwrite INTO TABLE bigdata_odl.mac;
创建内部表时需要load数据才会创建元数据,而且删除了表则数据也删除了;注意: 'hdfs://cdh01:8020/logstash/test.txt' 为cdh连接,非cdh则为:'hdfs://logstash/test.txt'