296 messages

org.apache.lucene.nutch-dev [All Lists]

2012 January [All Months]

Page 6 (Messages 126 to 150): 1 2 3 4 5 6 7 8 9 10 11 12

[jira] [Updated] (NUTCH-1210) DomainBlacklistFilter - Markus Jelsma (Updated) (JIRA)
[jira] [Commented] (NUTCH-1138) remove LogUtil from trunk and nutch gora - Markus Jelsma (Commented) (JIRA)
[jira] [Resolved] (NUTCH-1239) Webgraph should remove deleted pages from segment input - Markus Jelsma (Resolved) (JIRA)
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records - Julien Nioche (Commented) (JIRA)
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records - Markus Jelsma (Commented) (JIRA)
[jira] [Updated] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment - Edward Drapkin (Updated) (JIRA)
[jira] [Commented] (NUTCH-1244) CrawlDBDumper to filter by regex - Julien Nioche (Commented) (JIRA)
Re: Build failed in Jenkins: Nutch-trunk #1714 - Markus Jelsma
[jira] [Closed] (NUTCH-1237) Improve javac arguements for more verbose output - Lewis John McGibbney (Closed) (JIRA)
[jira] [Commented] (NUTCH-1237) Improve javac arguements for more verbose output - Hudson (Commented) (JIRA)
[jira] [Resolved] (NUTCH-1244) CrawlDBDumper to filter by regex - Markus Jelsma (Resolved) (JIRA)
[Nutch Wiki] Trivial Update of "bin/nutch_readdb" by MarkusJelsma - Apache Wiki
[jira] [Resolved] (NUTCH-1139) Indexer to delete documents - Markus Jelsma (Resolved) (JIRA)
[jira] [Commented] (NUTCH-1246) Upgrade to Hadoop 1.0.0 - Julien Nioche (Commented) (JIRA)
[jira] [Commented] (NUTCH-1177) Generator to select on retry interval - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1248) Generator to select on status - Hudson (Commented) (JIRA)
[jira] [Assigned] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment - Markus Jelsma (Assigned) (JIRA)
[jira] [Updated] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment - Edward Drapkin (Updated) (JIRA)
[jira] [Commented] (NUTCH-1201) Allow for different FetcherThread impls - Edward Drapkin (Commented) (JIRA)
[jira] [Commented] (NUTCH-1251) Deletion of duplicates fails with org.apache.solr.client.solrj.SolrServerException - Markus Jelsma (Commented) (JIRA)
minor suggestion to ivy.xml of plugins (remove nutch.root property) - Ferdy Galema
[jira] [Created] (NUTCH-1256) WebGraph to dump host + score - Markus Jelsma (Created) (JIRA)
[jira] [Updated] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again - Markus Jelsma (Updated) (JIRA)
[jira] [Commented] (NUTCH-1262) Map `duplicating` content-types to a single type - Julien Nioche (Commented) (JIRA)
[jira] [Created] (NUTCH-1263) FetcherJob must put 'fetchTime' on input - Ferdy Galema (Created) (JIRA)

Page 6 (Messages 126 to 150): 1 2 3 4 5 6 7 8 9 10 11 12