284 messages

org.apache.lucene.nutch-dev [All Lists]

2007 July [All Months]

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8 9 10 11 12

Nutch nightly build and NUTCH-505 draft patch - Kai_testing Middleton
Build failed in Hudson: Nutch-Nightly #137 - hud...@lucene.zones.apache.org
URL Injection with another source than text files - Epo Jemba
OPIC scoring differences - Carl Cerecke
Re: OPIC scoring differences - Andrzej Bialecki
Re: Not renewing CrawlDatum on Inject - Robert Young
Re: OPIC scoring differences - Doğacan Güney
NUTCH CONSULTANT NEEDED - Luca Rondanini
running nutch of nfs - prem kumar
inject command fail on whole-web run - Tsengtan A Shuy
RE: OOM error during parsing with nekohtml - Tsengtan A Shuy
OOM error during parsing with nekohtml - Shailendra Mudgal
Re: OOM error during parsing with nekohtml - Kai_testing Middleton
Re: OOM error during parsing with nekohtml - Doğacan Güney
RE: no nutch script file under bin directory - Tsengtan A Shuy
RE: no nutch script file under bin directory - Tsengtan A Shuy
Looking to fix relative path issue in linkdb - Robert Young
Re: Looking to fix relative path issue in linkdb - Robert Young
[jira] Commented: (NUTCH-25) needs 'character encoding' detector - Doug Cook (JIRA)
[jira] Created: (NUTCH-527) MapWritable doesn't support all hadoops writable types - Rob Young (JIRA)
CrawlDbReader TopN - Emmanuel
[jira] Resolved: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE - Doğacan Güney (JIRA)
[jira] Closed: (NUTCH-516) Next fetch time is not set when it is a CrawlDatum.STATUS_FETCH_GONE - Doğacan Güney (JIRA)
Error indexer - Le Quoc Anh
[jira] Commented: (NUTCH-533) LinkDbMerger: url normlaized is not updated in the key and inlinks list - Doğacan Güney (JIRA)

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8 9 10 11 12