zoukankan      html  css  js  c++  java
  • HIVE HOW TO LOAD DATA_Candice Jing_百度空间

    HIVE HOW TO LOAD DATA_Candice Jing_百度空间

    HIVE HOW TO LOAD DATA

    How to load data into Hive
    ------------------------
    In order to load data into Hive, we need to tell Hive the format of the data
    through "CREATE TABLE" statement:

    FileFormat: the data has to be in Text or SequenceFile.
    Format of the row:
    If the data is in delimited format, use MetadataTypedColumnsetSerDe
    If the data is in delimited format and has more than 1 levels of delimitor,
    use DynamicSerDe with TCTLSeparatedProtocol
    If the data is a serialized thrift object, use ThriftSerDe
    The steps to load the data:
    1 Create a table:

    CREATE TABLE t (foo STRING, bar STRING)
    ROW FORMAT DELIMITED
    FIELDS TERMINATED BY '\t'
    STORED AS TEXTFILE; 

    CREATE TABLE t2 (foo STRING, bar ARRAY<STRING>)
    ROW FORMAT DELIMITED
    FIELDS TERMINATED BY '\t'
    COLLECTION ITEMS TERMINATED BY ','
    STORED AS TEXTFILE; 

    CREATE TABLE t3 (foo STRING, bar MAP<STRING,STRING>)
    ROW FORMAT DELIMITED
    FIELDS TERMINATED BY '\t'
    COLLECTION ITEMS TERMINATED BY ','
    MAP KEYS TERMINATED BY ':'
    STORED AS TEXTFILE;

    CREATE TABLE t4 (foo STRING, bar MAP<STRING,STRING>)
    ROW FORMAT SERIALIZER 'org.apache.hadoop.hive .serde2.MetadataTypedColumnsetSerDe'
    WITH SERDEPROPERTIES ('columns'='foo,bar','SERIALIZATION.FORMAT'='9'); 

    (RegexDeserializer is not done yet)
    CREATE TABLE t5 (foo STRING, bar STRING)
    ROW FORMAT SERIALIZER 'org.apache.hadoop.hive.serde2.RegexDeserializer'
    WITH SERDEPROPERTIES ('regex'='([a-z]*) ([a-z])'); 

    2 Load the data:
    LOAD DATA LOCAL INPATH '../examples/files/kv1.txt' OVERWRITE INTO TABLE t; 

    How to read data from Hive tables
    ------------------------
    In order to read data from Hive tables, we need to know the same 2 things:

    File Format
    Row Format
    Then we just need to directly open the HDFS file and read the data.

    CREATE TABLE table1 (a STRING, b STRING)
    ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' ESCAPED BY '\\'
    STORED AS TEXTFILE; 

    ESCAPED BY 指定转义字符

  • 相关阅读:
    转载—javascript 设计模式 文章很长,请自备瓜子,水果和眼药水
    js 中call()方法的使用
    上传、下载
    steps1>Struct2配置文件
    页面刷新
    steps1>Struct2控制器组件
    steps1>Struct2概述
    steps1>Struct2基本流程
    steps1>Struct2struts.xml
    steps1>Struct2web.xml
  • 原文地址:https://www.cnblogs.com/lexus/p/2872267.html
Copyright © 2011-2022 走看看