296 messages

org.apache.lucene.nutch-dev [All Lists]

2012 January [All Months]

Page 3 (Messages 51 to 75): 1 2 3 4 5 6 7 8 9 10 11 12

[jira] [Resolved] (NUTCH-1212) ParseOutputFormat has redundant code - Markus Jelsma (Resolved) (JIRA)
Build failed in Jenkins: Nutch-trunk #1713 - Apache Jenkins Server
What to do with items for which is no parser? - Markus Jelsma
Re: What to do with items for which is no parser? - Markus Jelsma
[jira] [Commented] (NUTCH-1241) CrawlDBScanner should also be able to find records - Markus Jelsma (Commented) (JIRA)
[jira] [Resolved] (NUTCH-1236) Add link to site documentation to download older versions of Nutch. - Lewis John McGibbney (Resolved) (JIRA)
[jira] [Updated] (NUTCH-827) HTTP POST Authentication - Markus Jelsma (Updated) (JIRA)
[Nutch Wiki] Trivial Update of "bin/nutch solrindex" by MarkusJelsma - Apache Wiki
[jira] [Commented] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again - José Gil (Commented) (JIRA)
[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons - Lewis John McGibbney (Commented) (JIRA)
[jira] [Commented] (NUTCH-1176) Fix all javadoc warnings from nightly builds - Hudson (Commented) (JIRA)
[jira] [Commented] (NUTCH-1176) Fix all javadoc warnings from nightly builds - Hudson (Commented) (JIRA)
[jira] [Created] (NUTCH-1249) Resolve all issues flagged up by adding javac -Xlint arguement - Lewis John McGibbney (Created) (JIRA)
Jenkins build is back to normal : Nutch-trunk #1730 - Apache Jenkins Server
[jira] [Commented] (NUTCH-1247) CrawlDatum.retries should be int - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1247) CrawlDatum.retries should be int - Sebastian Nagel (Commented) (JIRA)
[jira] [Created] (NUTCH-1250) parse-html does not parse links with empty anchor - Andreas Janning (Created) (JIRA)
[jira] [Updated] (NUTCH-1251) Deletion of duplicates fails with org.apache.solr.client.solrj.SolrServerException - Arkadi Kosmynin (Updated) (JIRA)
Get target URL of redirects - Markus Jelsma
Re: Get target URL of redirects - Lewis John Mcgibbney
[jira] [Updated] (NUTCH-1252) SegmentReader -get shows wrong data - Sebastian Nagel (Updated) (JIRA)
[jira] [Commented] (NUTCH-1260) Fetcher should log fetching of redirects - Hudson (Commented) (JIRA)
Re: Get target URL of redirects - Markus Jelsma
[jira] [Created] (NUTCH-1262) Map `duplicating` content-types to a single type - Markus Jelsma (Created) (JIRA)
[jira] [Commented] (NUTCH-1256) WebGraph to dump host + score - Hudson (Commented) (JIRA)

Page 3 (Messages 51 to 75): 1 2 3 4 5 6 7 8 9 10 11 12