zoukankan      html  css  js  c++  java
  • Some Notes For programing on hadoop

    1 We must distinguish between the abstract class and the interface with them same name,
      beacause they have different base classes or interfaces.
    2 In the class and subclass of InputFormat,we should remember the unit of measurement is Block,
      and the blocks of different files should not be divided into the same split.The 'splitNum' in
      the InputFormat interface only the reference.Of course,we may use it to compute how many 
      Blocks should be divided into one split.
    3 the differences between 'ListStatus' and 'globStatus',just in my mind 'ListStatus' may get
      all the files from the directory ,'globStatus' is used for filter some types of files,

      eg. fs.globStatus(new Path("/2007/*") ,new RegexExcludeFilter("^.*/2007/12$"))

  • 相关阅读:
    SQL里面的函数应用
    Split的小用法
    堆栈和堆问题
    break,continue,goto,Return几个方法
    接口笔记
    抽象类
    虚方法
    将博客搬至CSDN
    运行数据区
    美团-走迷宫
  • 原文地址:https://www.cnblogs.com/ai464068163/p/1867849.html
Copyright © 2011-2022 走看看