zoukankan      html  css  js  c++  java
  • 【Nutch2.2.1基础教程之1】nutch相关异常 分类: H3_NUTCH 2014-08-08 21:46 1549人阅读 评论(2) 收藏


    1、在任务一开始运行,注入Url时即出现以下错误。

    InjectorJob: Injecting urlDir: urls 

    InjectorJob: Using class org.apache.gora.hbase.store.HBaseStore as the Gora storage class. 

    InjectorJob: java.lang.RuntimeException: job failed: name=[20140000]inject urls, jobid=job_local1629320149_0001 
    at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54) 
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233) 
    at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251) 
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273) 
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) 
    at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
    原因是regex-urlfilter.txt配置错误

    版权声明:本文为博主原创文章,未经博主允许不得转载。

  • 相关阅读:
    [UVa514] Rails
    今日才真正懂了BFS
    [UVa11292] Dragon of Loowater
    [UVa] TEX Quotes
    白书杂七杂八
    [OpenJudge] Feed_Accounting
    [OpenJudge] Jolly_Jumpers
    Restart
    Collection of Websites
    Oracle11完全卸载方法
  • 原文地址:https://www.cnblogs.com/lujinhong2/p/4637262.html
Copyright © 2011-2022 走看看