296 messages

org.apache.lucene.nutch-dev [All Lists]

2012 January [All Months]

Page 10 (Messages 226 to 250): 1 2 3 4 5 6 7 8 9 10 11 12

[jira] [Updated] (NUTCH-1210) DomainBlacklistFilter - Markus Jelsma (Updated) (JIRA)
[jira] [Resolved] (NUTCH-1017) Exception getting mime type by name - Markus Jelsma (Resolved) (JIRA)
[jira] [Resolved] (NUTCH-1064) o.a.n.util.MimeUtil uses deprecated Tika methods - Markus Jelsma (Resolved) (JIRA)
[jira] [Closed] (NUTCH-1106) Options to skip url's based on length - Markus Jelsma (Closed) (JIRA)
[jira] [Commented] (NUTCH-1232) Remove host field from index-basic - Hudson (Commented) (JIRA)
[jira] [Commented] (NUTCH-1146) Get rid of _success files in webgraph code - Hudson (Commented) (JIRA)
[jira] [Commented] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora - Lewis John McGibbney (Commented) (JIRA)
[jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output - Hudson (Commented) (JIRA)
[jira] [Commented] (NUTCH-1246) Upgrade to Hadoop 1.0.0 - Lewis John McGibbney (Commented) (JIRA)
[jira] [Commented] (NUTCH-1177) Generator to select on retry interval - Hudson (Commented) (JIRA)
[jira] [Commented] (NUTCH-1247) CrawlDatum.retries should be int - Markus Jelsma (Commented) (JIRA)
[jira] [Created] (NUTCH-1253) Incompatible neko and xerces versions - Dennis Spathis (Created) (JIRA)
[jira] [Commented] (NUTCH-1201) Allow for different FetcherThread impls - Edward Drapkin (Commented) (JIRA)
make nutch plugin to get termfreqvectors - Ale
Re: minor suggestion to ivy.xml of plugins (remove nutch.root property) - Lewis John Mcgibbney
[jira] [Commented] (NUTCH-1201) Allow for different FetcherThread impls - Ken Krugler (Commented) (JIRA)
[jira] [Updated] (NUTCH-1255) Change ivy.xml of all plugins to remove "nutch.root" property - Ferdy Galema (Updated) (JIRA)
[jira] [Created] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata - Markus Jelsma (Created) (JIRA)
[jira] [Updated] (NUTCH-1252) SegmentReader -get shows wrong data - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1260) Fetcher should log fetching of redirects - Sebastian Nagel (Updated) (JIRA)
Re: % of different content types out there on the web - Julien Nioche
[jira] [Created] (NUTCH-1261) Make numReducers configurable for indexer - Markus Jelsma (Created) (JIRA)
Re: % of different content types out there on the web - Mattmann, Chris A (388J)
[jira] [Updated] (NUTCH-1262) Map `duplicating` content-types to a single type - Markus Jelsma (Updated) (JIRA)
[jira] [Commented] (NUTCH-1081) ant tests fail - Lewis John McGibbney (Commented) (JIRA)

Page 10 (Messages 226 to 250): 1 2 3 4 5 6 7 8 9 10 11 12