197 messages

org.apache.lucene.nutch-dev [All Lists]

2015 July [All Months]

Page 5 (Messages 101 to 125): 1 2 3 4 5 6 7 8

[jira] [Resolved] (NUTCH-1980) Jexl expressions for CrawlDbReader - Markus Jelsma (JIRA)
[jira] [Updated] (NUTCH-1980) Jexl expressions for CrawlDbReader - Markus Jelsma (JIRA)
[jira] [Commented] (NUTCH-1980) Jexl expressions for CrawlDbReader - Hudson (JIRA)
[jira] [Updated] (NUTCH-1838) Host and domain based regex and automaton filtering - Markus Jelsma (JIRA)
[jira] [Commented] (NUTCH-1940) Port HTTP POST Authentication to 2.X - Lewis John McGibbney (JIRA)
[jira] [Commented] (NUTCH-2055) Random Crawl Delay - Sebastian Nagel (JIRA)
[Nutch Wiki] Update of "AsitangMishra" by AsitangMishra - Apache Wiki
[jira] [Commented] (NUTCH-2058) Indexer plugin that allows RegEx replacements on the NutchDocument field values - Peter Ciuffetti (JIRA)
[jira] [Commented] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins - Hudson (JIRA)
Build failed in Jenkins: Nutch-trunk #3192 - Apache Jenkins Server
Re: GSOC2015- Sitemap crawler roudmap problems - Cihad Guzel
[jira] [Created] (NUTCH-2060) dedup is removing entries with status db_gone - Steven Hayles (JIRA)
[jira] [Commented] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins - Chris A. Mattmann (JIRA)
[jira] [Commented] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins - Peter Ciuffetti (JIRA)
[jira] [Commented] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins - Peter Ciuffetti (JIRA)
[jira] [Commented] (NUTCH-2060) dedup is removing entries with status db_gone - Ashish Nerkar (JIRA)
[jira] [Commented] (NUTCH-2058) Indexer plugin that allows RegEx replacements on the NutchDocument field values - Chris A. Mattmann (JIRA)
[jira] [Updated] (NUTCH-2064) URLNormalizer basic to properly encode non-ASCII characters - Markus Jelsma (JIRA)
[jira] [Updated] (NUTCH-2064) URLNormalizer basic to properly encode non-ASCII characters - Markus Jelsma (JIRA)
[jira] [Commented] (NUTCH-2064) URLNormalizer basic to properly encode non-ASCII characters - Sebastian Nagel (JIRA)
[jira] [Created] (NUTCH-2066) Allow user to specify crawldb and segment db in the Generate JOb REST endpoint - Sujen Shah (JIRA)
[jira] [Commented] (NUTCH-2048) parse-tika: fix dependencies in plugin.xml - Sebastian Nagel (JIRA)
[jira] [Updated] (NUTCH-2070) Allow user to specify segment to Fetch via the REST API - Sujen Shah (JIRA)
[jira] [Commented] (NUTCH-2072) Deflate encoding support is broken when http.content.limit is set to -1 - ASF GitHub Bot (JIRA)
[jira] [Commented] (NUTCH-1785) Ability to index raw content - Thad Guidry (JIRA)

Page 5 (Messages 101 to 125): 1 2 3 4 5 6 7 8