232 messages

org.apache.lucene.nutch-dev [All Lists]

2018 April [All Months]

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8 9 10

[jira] [Resolved] (NUTCH-2509) Inconsistent behavior in SitemapProcessor - Sebastian Nagel (JIRA)
[jira] [Resolved] (NUTCH-2545) Upgrade to Any23 2.2 - Lewis John McGibbney (JIRA)
[jira] [Commented] (NUTCH-2545) Upgrade to Any23 2.2 - Hudson (JIRA)
[jira] [Commented] (NUTCH-2548) Compressed content skipped. Content of size 78 was truncated to 74 - ASF GitHub Bot (JIRA)
[jira] [Commented] (NUTCH-2518) Must check return value of job.waitForCompletion() - ASF GitHub Bot (JIRA)
[jira] [Created] (NUTCH-2553) Fetcher not to modify URLs to be fetched - Sebastian Nagel (JIRA)
[jira] [Comment Edited] (NUTCH-2549) protocol-http does not behave the same as browsers - Gerard Bouchar (JIRA)
[jira] [Updated] (NUTCH-2561) protocol-http can be made to read arbitrarily large HTTP responses - Gerard Bouchar (JIRA)
[jira] [Commented] (NUTCH-2551) NullPointerException in generator - Omkar Reddy (JIRA)
[jira] [Updated] (NUTCH-2533) Injector: NullPointerException if seed URL dir contains non-file entries - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-2533) Injector: NullPointerException if seed URL dir contains non-file entries - ASF GitHub Bot (JIRA)
[jira] [Commented] (NUTCH-2518) Must check return value of job.waitForCompletion() - ASF GitHub Bot (JIRA)
[jira] [Commented] (NUTCH-2551) NullPointerException in generator - Sebastian Nagel (JIRA)
[Nutch Wiki] Update of "NutchHadoopSingleNodeTutorial" by SebastianNagel - Apache Wiki
[jira] [Commented] (NUTCH-2568) Caught exception is immediately rethrown - ASF GitHub Bot (JIRA)
[jira] [Commented] (NUTCH-2553) Fetcher not to modify URLs to be fetched - ASF GitHub Bot (JIRA)
[jira] [Resolved] (NUTCH-2553) Fetcher not to modify URLs to be fetched - Sebastian Nagel (JIRA)
[jira] [Assigned] (NUTCH-2569) ClassNotFoundException when running in (pseudo-)distributed mode - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-2570) Deduplication job fails to install deduplicated CrawlDb - ASF GitHub Bot (JIRA)
[jira] [Assigned] (NUTCH-2544) Nutch 1.15 no longer compatible with AWS EMR and S3 - Sebastian Nagel (JIRA)
[jira] [Updated] (NUTCH-2571) SegmentReader -list fails to read segment - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-1228) Change mapred.task.timeout to mapreduce.task.timeout in fetcher - Hudson (JIRA)
[jira] [Commented] (NUTCH-1228) Change mapred.task.timeout to mapreduce.task.timeout in fetcher - Markus Jelsma (JIRA)
[jira] [Commented] (NUTCH-2570) Deduplication job fails to install deduplicated CrawlDb - ASF GitHub Bot (JIRA)
[jira] [Commented] (NUTCH-2570) Deduplication job fails to install deduplicated CrawlDb - Hudson (JIRA)

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8 9 10