zoukankan      html  css  js  c++  java
  • Some Notes For programing on hadoop

    1 We must distinguish between the abstract class and the interface with them same name,
      beacause they have different base classes or interfaces.
    2 In the class and subclass of InputFormat,we should remember the unit of measurement is Block,
      and the blocks of different files should not be divided into the same split.The 'splitNum' in
      the InputFormat interface only the reference.Of course,we may use it to compute how many 
      Blocks should be divided into one split.
    3 the differences between 'ListStatus' and 'globStatus',just in my mind 'ListStatus' may get
      all the files from the directory ,'globStatus' is used for filter some types of files,

      eg. fs.globStatus(new Path("/2007/*") ,new RegexExcludeFilter("^.*/2007/12$"))

  • 相关阅读:
    Mybatis学习随笔3
    Mybatis学习随笔2
    Mybatis学习随笔
    Java校招面试-什么是线程安全/不安全
    装饰器2
    装饰器
    默认传参的陷阱
    处理日志文件
    第二天
    用户登录
  • 原文地址:https://www.cnblogs.com/ai464068163/p/1867849.html
Copyright © 2011-2022 走看看