zoukankan      html  css  js  c++  java
  • hadoop 操作

    官方文档:https://hadoop.apache.org/docs/r3.1.2/

    hadoop fs -ls /   看根目录下的所有文件

    hdfs dfs -put alice.txt 上传

    hdfs dfs -get /hdfsPath /localPath  下载到本地

    hdfs dfsadmin -report.

    Edits + Fsimage过程在 hdfs-site.xml 中指定路径。
    启动服务时可能会删除一些文件:sudo rm -r /usr/local/hadoop_store/hdfs/datanode/current

    current文件夹下对应的文件:

    1. Fsimage文件:HDFS文件系统元数据的一个永久性的检查点,其中包含HDFS文件系统的所有目录和文件idnode的序列化信息;
    2. Fsimage.md5文件:是镜像文件的 md5 校验文件,这个校验文件是为了判断镜像文件是否被修改;
    3. Edits文件:存放HDFS文件系统的所有更新操作,文件系统客户端执行的所有写操作首先会被记录到Edits文件中。
    4. seen_txid文件:它代表的是 namenode 里面的 edits_* 文件的尾数,namenode 重启的时候,会按照 seen_txid 的数字, 循序从头跑 edits_0000001~ 到 seen_txid 的数字。
    5. VERSION文件:记录了当前NameNode的一些信息。

    显示HDFS块信息

    hadoop@node-master:~$ hdfs fsck / -files
    Connecting to namenode via http://node-master:9870/fsck?ugi=hadoop&files=1&path=%2F
    FSCK started by hadoop (auth:SIMPLE) from /192.168.56.2 for path / at Sun Oct 27 17:37:28 AEDT 2019
    / <dir>
    /alice.txt 173595 bytes, replicated: replication=2, 1 block(s):  OK
    /readme.txt 152 bytes, replicated: replication=2, 1 block(s):  OK
    
    Status: HEALTHY
     Number of data-nodes:    3
     Number of racks:        1
     Total dirs:            1
     Total symlinks:        0
    
    Replicated Blocks:
     Total size:    173747 B
     Total files:    2
     Total blocks (validated):    2 (avg. block size 86873 B)
     Minimally replicated blocks:    2 (100.0 %)
     Over-replicated blocks:    0 (0.0 %)
     Under-replicated blocks:    0 (0.0 %)
     Mis-replicated blocks:        0 (0.0 %)
     Default replication factor:    2
     Average block replication:    2.0
     Missing blocks:        0
     Corrupt blocks:        0
     Missing replicas:        0 (0.0 %)
    
    Erasure Coded Block Groups:
     Total size:    0 B
     Total files:    0
     Total block groups (validated):    0
     Minimally erasure-coded block groups:    0
     Over-erasure-coded block groups:    0
     Under-erasure-coded block groups:    0
     Unsatisfactory placement block groups:    0
     Average block group size:    0.0
     Missing block groups:        0
     Corrupt block groups:        0
     Missing internal blocks:    0
    FSCK ended at Sun Oct 27 17:37:28 AEDT 2019 in 3 milliseconds
    
    
    The filesystem under path '/' is HEALTHY
    hadoop@node-master:~$ hdfs fsck / -blocks
    Connecting to namenode via http://node-master:9870/fsck?ugi=hadoop&blocks=1&path=%2F
    FSCK started by hadoop (auth:SIMPLE) from /192.168.56.2 for path / at Sun Oct 27 17:40:29 AEDT 2019
    
    Status: HEALTHY
     Number of data-nodes:    3
     Number of racks:        1
     Total dirs:            1
     Total symlinks:        0
    
    Replicated Blocks:
     Total size:    173747 B
     Total files:    2
     Total blocks (validated):    2 (avg. block size 86873 B)
     Minimally replicated blocks:    2 (100.0 %)
     Over-replicated blocks:    0 (0.0 %)
     Under-replicated blocks:    0 (0.0 %)
     Mis-replicated blocks:        0 (0.0 %)
     Default replication factor:    2
     Average block replication:    2.0
     Missing blocks:        0
     Corrupt blocks:        0
     Missing replicas:        0 (0.0 %)
    
    Erasure Coded Block Groups:
     Total size:    0 B
     Total files:    0
     Total block groups (validated):    0
     Minimally erasure-coded block groups:    0
     Over-erasure-coded block groups:    0
     Under-erasure-coded block groups:    0
     Unsatisfactory placement block groups:    0
     Average block group size:    0.0
     Missing block groups:        0
     Corrupt block groups:        0
     Missing internal blocks:    0
    FSCK ended at Sun Oct 27 17:40:29 AEDT 2019 in 4 milliseconds
    
    
    The filesystem under path '/' is HEALTHY
    hadoop@node-master:~$ hdfs fsck /readme.txt -files
    Connecting to namenode via http://node-master:9870/fsck?ugi=hadoop&files=1&path=%2Freadme.txt
    FSCK started by hadoop (auth:SIMPLE) from /192.168.56.2 for path /readme.txt at Sun Oct 27 17:43:58 AEDT 2019
    /readme.txt 152 bytes, replicated: replication=2, 1 block(s):  OK
    
    Status: HEALTHY
     Number of data-nodes:    3
     Number of racks:        1
     Total dirs:            0
     Total symlinks:        0
    
    Replicated Blocks:
     Total size:    152 B
     Total files:    1
     Total blocks (validated):    1 (avg. block size 152 B)
     Minimally replicated blocks:    1 (100.0 %)
     Over-replicated blocks:    0 (0.0 %)
     Under-replicated blocks:    0 (0.0 %)
     Mis-replicated blocks:        0 (0.0 %)
     Default replication factor:    2
     Average block replication:    2.0
     Missing blocks:        0
     Corrupt blocks:        0
     Missing replicas:        0 (0.0 %)
    
    Erasure Coded Block Groups:
     Total size:    0 B
     Total files:    0
     Total block groups (validated):    0
     Minimally erasure-coded block groups:    0
     Over-erasure-coded block groups:    0
     Under-erasure-coded block groups:    0
     Unsatisfactory placement block groups:    0
     Average block group size:    0.0
     Missing block groups:        0
     Corrupt block groups:        0
     Missing internal blocks:    0
    FSCK ended at Sun Oct 27 17:43:58 AEDT 2019 in 1 milliseconds
    
    
    The filesystem under path '/readme.txt' is HEALTHY

  • 相关阅读:
    Python学习之路2☞数据类型与变量
    Python学习之路1☞简介及入门代码
    OpenStack组件系列☞horizon搭建
    39
    38
    37
    36
    35
    33
    32
  • 原文地址:https://www.cnblogs.com/cschen588/p/11747252.html
Copyright © 2011-2022 走看看