- 论坛徽章:
- 0
|
本人的nutch的环境调试遇到了很多难题,各位大侠请看看。
之前nutch的安装一直不成功,出现的错误如下
run java in /usr/Java/jdk
060319 220333 parsing file:/usr/local/nutch/conf/nutch-default.xml
060319 220333 parsing file:/usr/local/nutch/conf/crawl-tool.xml
060319 220333 parsing file:/usr/local/nutch/conf/nutch-site.xml
060319 220333 No FS indicated, using default:local
060319 220333 crawl started in: crawl.demo
060319 220333 rootUrlFile = 4
060319 220333 threads = 10
060319 220333 depth = 2
060319 220334 Created webdb at LocalFS,/usr/local/nutch/crawl.demo/db
Exception in thread "main" java.io.FileNotFoundException: 4 (No such file or directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.<init>(FileInputStream.java:106)
at java.io.FileReader.<init>(FileReader.java:55)
at org.apache.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:372)
at org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535)
at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:134)
前几天忽然想起了一个问题,因为我的nutch是用root的身份运行的,可能问题就出在此,然后新建了一个其他的用户,安装,运行,nutch成功地执行了内网爬行任务,并且创建了索引。
但是在次开机时再次用nutch抓网页的时候之前的问题又出现了。 |
|