200 messages

org.apache.lucene.nutch-dev [All Lists]

2017 November [All Months]

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8

Re: Fwd: Maven configuration - Sebastian Nagel
[jira] [Commented] (NUTCH-2442) Injector to stop if job fails to avoid loss of CrawlDb - Omkar Reddy (JIRA)
[jira] [Commented] (NUTCH-2442) Injector to stop if job fails to avoid loss of CrawlDb - ASF GitHub Bot (JIRA)
[jira] [Resolved] (NUTCH-2443) Extract links from the video tag with the parse-html plugin - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-2431) Filterchecker to implement Tool-interface - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-2422) Update information about git repository - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-2420) Bug in variable generate.max.count and fetcher.server.delay - Hudson (JIRA)
[jira] [Comment Edited] (NUTCH-2451) MalformedURLExceptions on perfectly looking URLs? - Hiran Chaudhuri (JIRA)
[jira] [Commented] (NUTCH-2456) Redirected documents are not indexed - Yossi Tamari (JIRA)
[jira] [Commented] (NUTCH-1480) SolrIndexer to write to multiple servers. - ASF GitHub Bot (JIRA)
Request for patches review - Semyon Semyonov
[jira] [Updated] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium - hussein Al_Ahmad (JIRA)
[jira] [Commented] (NUTCH-2368) Variable generate.max.count and fetcher.server.delay - Semyon Semyonov (JIRA)
[jira] [Commented] (NUTCH-2368) Variable generate.max.count and fetcher.server.delay - Semyon Semyonov (JIRA)
[jira] [Comment Edited] (NUTCH-2457) Embedded documents likely not correctly parsed by Tika - Tim Allison (JIRA)
[jira] [Commented] (NUTCH-2457) Embedded documents likely not correctly parsed by Tika - Tim Allison (JIRA)
Re: Fwd: Maven configuration - Raffaele Palmieri
[jira] [Updated] (NUTCH-2464) Headers That Contain HTML Elements Are Not Parsed - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-2464) Headers That Contain HTML Elements Are Not Parsed - Cass Pallansch (JIRA)
[jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium - ASF GitHub Bot (JIRA)
[jira] [Resolved] (NUTCH-2463) Enable sampling CrawlDB - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-2461) Generate passes the data to when maxCount == 0 - ASF GitHub Bot (JIRA)
[jira] [Commented] (NUTCH-2441) ARG_SEGMENT usage - ASF GitHub Bot (JIRA)
[jira] [Commented] (NUTCH-2467) Sitemap type field can be null - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-2465) Broken Eclipse project. Classpaths and interactiveselenium should be fixed. - Hudson (JIRA)

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8