zoukankan      html  css  js  c++  java
  • hbase-0.92.1过滤器学习

    查看当前hbase中支持的过滤器

    hbase(main):060:0* show_filters
    Documentation on filters mentioned below can be found at: https://our.intern.facebook.com/intern/wiki/index.php/HBase/Filter_Language                                                                                                                        
    ColumnPrefixFilter                                                                                                                                                                                                                                           
    TimestampsFilter                                                                                                                                                                                                                                             
    PageFilter                                                                                                                                                                                                                                                   
    MultipleColumnPrefixFilter                                                                                                                                                                                                                                   
    FamilyFilter                                                                                                                                                                                                                                                 
    ColumnPaginationFilter                                                                                                                                                                                                                                       
    SingleColumnValueFilter                                                                                                                                                                                                                                      
    RowFilter                                                                                                                                                                                                                                                    
    QualifierFilter                                                                                                                                                                                                                                              
    ColumnRangeFilter                                                                                                                                                                                                                                            
    ValueFilter                                                                                                                                                                                                                                                  
    PrefixFilter                                                                                                                                                                                                                                                 
    SingleColumnValueExcludeFilter                                                                                                                                                                                                                               
    ColumnCountGetFilter                                                                                                                                                                                                                                         
    InclusiveStopFilter                                                                                                                                                                                                                                          
    DependentColumnFilter                                                                                                                                                                                                                                        
    FirstKeyOnlyFilter                                                                                                                                                                                                                                           
    KeyOnlyFilter                       

    新建测试表和数据

    hbase(main):072:0* create 'emp', 'mycf'
    
    put 'emp', 'row1', 'mycf:name', 'adams'
    put 'emp', 'row1', 'mycf:depart', 'research'
    put 'emp', 'row1', 'mycf:job', 'clerk'
    put 'emp', 'row1', 'mycf:id', '7876'
    put 'emp', 'row1', 'mycf:locate', 'dallas'
    
    put 'emp', 'row2', 'mycf:name', 'allen'
    put 'emp', 'row2', 'mycf:depart', 'sales'
    put 'emp', 'row2', 'mycf:job', 'salesman'
    put 'emp', 'row2', 'mycf:id', '7499'
    put 'emp', 'row2', 'mycf:locate', 'chicago'
    
    put 'emp', 'row3', 'mycf:name', 'blake'
    put 'emp', 'row3', 'mycf:depart', 'sales'
    put 'emp', 'row3', 'mycf:job', 'manager'
    put 'emp', 'row3', 'mycf:id', '7698'
    put 'emp', 'row3', 'mycf:locate', 'chicago'
    
    put 'emp', 'row4', 'mycf:name', 'clark'
    put 'emp', 'row4', 'mycf:depart', 'accounting'
    put 'emp', 'row4', 'mycf:job', 'manager'
    put 'emp', 'row4', 'mycf:id', '7782'
    put 'emp', 'row4', 'mycf:locate', 'new york'
    
    put 'emp', 'row5', 'mycf:name', 'ford'
    put 'emp', 'row5', 'mycf:depart', 'research'
    put 'emp', 'row5', 'mycf:job', 'analyst'
    put 'emp', 'row5', 'mycf:id', '7902'
    put 'emp', 'row5', 'mycf:locate', 'dallas'
    
    put 'emp', 'row6', 'mycf:name', 'james'
    put 'emp', 'row6', 'mycf:depart', 'sales'
    put 'emp', 'row6', 'mycf:job', 'clerk'
    put 'emp', 'row6', 'mycf:id', '7900'
    put 'emp', 'row6', 'mycf:locate', 'chicago'
    
    put 'emp', 'row7', 'mycf:name', 'jones'
    put 'emp', 'row7', 'mycf:depart', 'research'
    put 'emp', 'row7', 'mycf:job', 'manager'
    put 'emp', 'row7', 'mycf:id', '7566'
    put 'emp', 'row7', 'mycf:locate', 'dallas'
    
    put 'emp', 'row8', 'mycf:name', 'king'
    put 'emp', 'row8', 'mycf:depart', 'accounting'
    put 'emp', 'row8', 'mycf:job', 'president'
    put 'emp', 'row8', 'mycf:id', '7839'
    put 'emp', 'row8', 'mycf:locate', 'new york'
    
    
    
    hbase(main):180:0> scan 'emp'
    ROW                                                              COLUMN+CELL                                                                                                                                                                                 
     row1                                                            column=mycf:depart, timestamp=1555846776542, value=research                                                                                                                                 
     row1                                                            column=mycf:id, timestamp=1555846776590, value=7876                                                                                                                                         
     row1                                                            column=mycf:job, timestamp=1555846776566, value=clerk                                                                                                                                       
     row1                                                            column=mycf:locate, timestamp=1555846776618, value=dallas                                                                                                                                   
     row1                                                            column=mycf:name, timestamp=1555846776511, value=adams                                                                                                                                      
     row2                                                            column=mycf:depart, timestamp=1555846776687, value=sales                                                                                                                                    
     row2                                                            column=mycf:id, timestamp=1555846776736, value=7499                                                                                                                                         
     row2                                                            column=mycf:job, timestamp=1555846776712, value=salesman                                                                                                                                    
     row2                                                            column=mycf:locate, timestamp=1555846776770, value=chicago                                                                                                                                  
     row2                                                            column=mycf:name, timestamp=1555846776662, value=allen                                                                                                                                      
     row3                                                            column=mycf:depart, timestamp=1555846776838, value=sales                                                                                                                                    
     row3                                                            column=mycf:id, timestamp=1555846776887, value=7698                                                                                                                                         
     row3                                                            column=mycf:job, timestamp=1555846776863, value=manager                                                                                                                                     
     row3                                                            column=mycf:locate, timestamp=1555846776912, value=chicago                                                                                                                                  
     row3                                                            column=mycf:name, timestamp=1555846776806, value=blake                                                                                                                                      
     row4                                                            column=mycf:depart, timestamp=1555846776976, value=accounting                                                                                                                               
     row4                                                            column=mycf:id, timestamp=1555846777027, value=7782                                                                                                                                         
     row4                                                            column=mycf:job, timestamp=1555846777002, value=manager                                                                                                                                     
     row4                                                            column=mycf:locate, timestamp=1555846777086, value=new york                                                                                                                                 
     row4                                                            column=mycf:name, timestamp=1555846776952, value=clark                                                                                                                                      
     row5                                                            column=mycf:depart, timestamp=1555846777146, value=research                                                                                                                                 
     row5                                                            column=mycf:id, timestamp=1555846777193, value=7902                                                                                                                                         
     row5                                                            column=mycf:job, timestamp=1555846777169, value=analyst                                                                                                                                     
     row5                                                            column=mycf:locate, timestamp=1555846777218, value=dallas                                                                                                                                   
     row5                                                            column=mycf:name, timestamp=1555846777121, value=ford                                                                                                                                       
     row6                                                            column=mycf:depart, timestamp=1555846777277, value=sales                                                                                                                                    
     row6                                                            column=mycf:id, timestamp=1555846777324, value=7900                                                                                                                                         
     row6                                                            column=mycf:job, timestamp=1555846777301, value=clerk                                                                                                                                       
     row6                                                            column=mycf:locate, timestamp=1555846777355, value=chicago                                                                                                                                  
     row6                                                            column=mycf:name, timestamp=1555846777253, value=james                                                                                                                                      
     row7                                                            column=mycf:depart, timestamp=1555846777416, value=research                                                                                                                                 
     row7                                                            column=mycf:id, timestamp=1555846777465, value=7566                                                                                                                                         
     row7                                                            column=mycf:job, timestamp=1555846777441, value=manager                                                                                                                                     
     row7                                                            column=mycf:locate, timestamp=1555846777491, value=dallas                                                                                                                                   
     row7                                                            column=mycf:name, timestamp=1555846777390, value=jones                                                                                                                                      
     row8                                                            column=mycf:depart, timestamp=1555846777556, value=accounting                                                                                                                               
     row8                                                            column=mycf:id, timestamp=1555846777604, value=7839                                                                                                                                         
     row8                                                            column=mycf:job, timestamp=1555846777581, value=president                                                                                                                                   
     row8                                                            column=mycf:locate, timestamp=1555846777628, value=new york                                                                                                                                 
     row8                                                            column=mycf:name, timestamp=1555846777526, value=king                                                                                                                                       
    8 row(s) in 0.1420 seconds

     PrefixFilter

    hbase(main):191:0* scan 'emp', {FILTER => "PrefixFilter ('row1')"}
    ROW                                                              COLUMN+CELL                                                                                                                                                                                 
     row1                                                            column=mycf:depart, timestamp=1555846776542, value=research                                                                                                                                 
     row1                                                            column=mycf:id, timestamp=1555846776590, value=7876                                                                                                                                         
     row1                                                            column=mycf:job, timestamp=1555846776566, value=clerk                                                                                                                                       
     row1                                                            column=mycf:locate, timestamp=1555846776618, value=dallas                                                                                                                                   
     row1                                                            column=mycf:name, timestamp=1555846776511, value=adams                                                                                                                                      
    1 row(s) in 0.0510 seconds

     ColumnPrefixFilter

    hbase(main):192:0> scan 'emp', {FILTER => "ColumnPrefixFilter ('name')"}
    ROW                                                              COLUMN+CELL                                                                                                                                                                                 
     row1                                                            column=mycf:name, timestamp=1555846776511, value=adams                                                                                                                                      
     row2                                                            column=mycf:name, timestamp=1555846776662, value=allen                                                                                                                                      
     row3                                                            column=mycf:name, timestamp=1555846776806, value=blake                                                                                                                                      
     row4                                                            column=mycf:name, timestamp=1555846776952, value=clark                                                                                                                                      
     row5                                                            column=mycf:name, timestamp=1555846777121, value=ford                                                                                                                                       
     row6                                                            column=mycf:name, timestamp=1555846777253, value=james                                                                                                                                      
     row7                                                            column=mycf:name, timestamp=1555846777390, value=jones                                                                                                                                      
     row8                                                            column=mycf:name, timestamp=1555846777526, value=king                                                                                                                                       
    8 row(s) in 0.0720 seconds

    ValueFilter

    hbase(main):038:0* scan 'emp', FILTER=>"ValueFilter(=,'substring:sales')"
    ROW                                                              COLUMN+CELL                                                                                                                                                                                 
     row2                                                            column=mycf:depart, timestamp=1555846776687, value=sales                                                                                                                                    
     row2                                                            column=mycf:job, timestamp=1555846776712, value=salesman                                                                                                                                    
     row3                                                            column=mycf:depart, timestamp=1555846776838, value=sales                                                                                                                                    
     row6                                                            column=mycf:depart, timestamp=1555846777277, value=sales                                                                                                                                    
    3 row(s) in 0.0540 seconds
    create 'test', 'cf1', { SPLITS => ['rk-1000', 'rk-2000', 'rk-3000'] }
    
    
    for i in '0'..'9' do for j in '0'..'9' do 
       for k in '0'..'9' do put 'test', "rk-#{i}#{j}#{k}", 
        "cf1:#{j}#{k}", "#{j}#{k}" end end end
  • 相关阅读:
    【PENNI】2020-ICML-PENNI: Pruned Kernel Sharing for Efficient CNN Inference-论文阅读
    【BlockSwap】2020-ICLR-BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget-论文阅读
    【MCUNet】2020-NIPS-MCUNet Tiny Deep Learning on IoT Devices-论文阅读
    【FSNet】2020-ICLR-FSNet Compression of Deep Convolutional Neural Networks by Filter Summary-论文阅读
    【Shape Adaptor】2020-ECCV-Shape Adaptor: A Learnable Resizing Module-论文阅读
    【FairNAS】2019-arxiv-FairNAS Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search-论文阅读
    【EagleEye】2020-ECCV-EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning-论文阅读
    【DMC】2020-CVPR-Discrete Model Compression with Resource Constraint for Deep Neural Networks-论文阅读
    【RegNet】2020-CVPR-Designing Network Design Spaces-论文阅读
    【DropNet】2020-ICML-DropNet Reducing Neural Network Complexity via Iterative Pruning-论文阅读
  • 原文地址:https://www.cnblogs.com/ilifeilong/p/10746652.html
Copyright © 2011-2022 走看看