296 messages

org.apache.lucene.nutch-dev [All Lists]

2012 January [All Months]

Page 5 (Messages 101 to 125): 1 2 3 4 5 6 7 8 9 10 11 12

[jira] [Commented] (NUTCH-1232) Remove host|site fields from index-basic - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records - Julien Nioche (Commented) (JIRA)
[jira] [Created] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment - Edward Drapkin (Created) (JIRA)
Build failed in Jenkins: Nutch-trunk #1715 - Apache Jenkins Server
[jira] [Created] (NUTCH-1243) Junit jar removed from lib - Julien Nioche (Created) (JIRA)
[jira] [Commented] (NUTCH-1146) Get rid of _success files in webgraph code - Hudson (Commented) (JIRA)
Re: Build failed in Jenkins: Nutch-trunk #1714 - Julien Nioche
Re: [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output - Lewis John Mcgibbney
Re: [jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output - Markus Jelsma
[jira] [Updated] (NUTCH-840) Port tests from parse-html to parse-tika - Lewis John McGibbney (Updated) (JIRA)
[jira] [Closed] (NUTCH-1138) remove LogUtil from trunk and nutch gora - Lewis John McGibbney (Closed) (JIRA)
Re: [jira] [Commented] (NUTCH-1246) Upgrade to Hadoop 1.0.0 - Markus Jelsma
[jira] [Resolved] (NUTCH-1177) Generator to select on retry interval - Markus Jelsma (Resolved) (JIRA)
[Nutch Wiki] Trivial Update of "PluginCentral" by ElisabethAdler - Apache Wiki
I want to volunteer some time - Eddie Drapkin
[jira] [Commented] (NUTCH-1247) CrawlDatum.retries should be int - Sebastian Nagel (Commented) (JIRA)
[jira] [Commented] (NUTCH-1201) Allow for different FetcherThread impls - Edward Drapkin (Commented) (JIRA)
[jira] [Commented] (NUTCH-1251) Deletion of duplicates fails with org.apache.solr.client.solrj.SolrServerException - Arkadi Kosmynin (Commented) (JIRA)
[jira] [Updated] (NUTCH-1252) SegmentReader -get shows wrong data - Sebastian Nagel (Updated) (JIRA)
Re: [DISCUSS] Issues with Fetcher - Mattmann, Chris A (388J)
[jira] [Commented] (NUTCH-1086) Rewrite protocol-httpclient - Oleg Kalnichevski (Commented) (JIRA)
[jira] [Commented] (NUTCH-1253) Incompatible neko and xerces versions - Ferdy Galema (Commented) (JIRA)
% of different content types out there on the web - Mattmann, Chris A (388J)
[jira] [Commented] (NUTCH-1262) Map `duplicating` content-types to a single type - Markus Jelsma (Commented) (JIRA)
Re: % of different content types out there on the web - Markus Jelsma

Page 5 (Messages 101 to 125): 1 2 3 4 5 6 7 8 9 10 11 12