296 messages

org.apache.lucene.nutch-dev [All Lists]

2012 January [All Months]

Page 9 (Messages 201 to 225): 1 2 3 4 5 6 7 8 9 10 11 12

[jira] [Resolved] (NUTCH-1240) Domain blacklist URL filter - Markus Jelsma (Resolved) (JIRA)
[jira] [Updated] (NUTCH-1232) Remove host field from index-basic - Markus Jelsma (Updated) (JIRA)
[jira] [Commented] (NUTCH-1239) Webgraph should remove deleted pages from segment input - Hudson (Commented) (JIRA)
[jira] [Created] (NUTCH-1241) CrawlDBScanner should also be able to find records - Markus Jelsma (Created) (JIRA)
Re: Build failed in Jenkins: Nutch-trunk #1714 - Markus Jelsma
Re: [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output - Lewis John Mcgibbney
[jira] [Updated] (NUTCH-1139) Indexer to delete documents - Markus Jelsma (Updated) (JIRA)
[jira] [Commented] (NUTCH-1244) CrawlDBDumper to filter by regex - Hudson (Commented) (JIRA)
[jira] [Created] (NUTCH-1246) Upgrade to Hadoop 1.0.0 - Julien Nioche (Created) (JIRA)
[jira] [Commented] (NUTCH-1247) CrawlDatum.retries should be int - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1247) CrawlDatum.retries should be int - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1176) Fix all javadoc warnings from nightly builds - Hudson (Commented) (JIRA)
[jira] [Updated] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment - Edward Drapkin (Updated) (JIRA)
[jira] [Commented] (NUTCH-1201) Allow for different FetcherThread impls - Edward Drapkin (Commented) (JIRA)
Re: [DISCUSS] Issues with Fetcher - Lewis John Mcgibbney
[jira] [Closed] (NUTCH-1255) Change ivy.xml of all plugins to remove "nutch.root" property - Ferdy Galema (Closed) (JIRA)
[jira] [Created] (NUTCH-1258) MoreIndexingFilter should be able to read Content-Type from both parse metadata and content metadata - Markus Jelsma (Created) (JIRA)
[jira] [Commented] (NUTCH-1258) MoreIndexingFilter should be able to read Content-Type from both parse metadata and content metadata - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1258) MoreIndexingFilter should be able to read Content-Type from both parse metadata and content metadata - Julien Nioche (Commented) (JIRA)
[jira] [Commented] (NUTCH-1086) Rewrite protocol-httpclient - Ferdy Galema (Commented) (JIRA)
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata - Markus Jelsma (Commented) (JIRA)
[jira] [Updated] (NUTCH-1260) Fetcher should log fetching of redirects - Markus Jelsma (Updated) (JIRA)
Build failed in Jenkins: Nutch-nutchgora #146 - Apache Jenkins Server
[jira] [Commented] (NUTCH-1256) WebGraph to dump host + score - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment - Hudson (Commented) (JIRA)

Page 9 (Messages 201 to 225): 1 2 3 4 5 6 7 8 9 10 11 12