347 messages

org.apache.lucene.nutch-dev [All Lists]

2012 April [All Months]

Page 3 (Messages 51 to 75): 1 2 3 4 5 6 7 8 9 10 11 12 13 14

Re: NutchGora release, and Nutch 1.x trunk release - Markus Jelsma
[jira] [Updated] (NUTCH-1273) Fix [deprecation] javac warnings - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-578) URL fetched with 403 is generated over and over again - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1233) Rely on Tika for outlink extraction - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1121) JUnit test for parse-js - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1223) Migrate WebGraph to MapReduce API - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1319) HostNormalizer - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1140) index-more plugin, resetTitle method creates multiple values in the Title field - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1317) Max content length by MIME-type - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1088) Write Solr XML documents - Markus Jelsma (Updated) (JIRA)
Re: NutchGora release, and Nutch 1.x trunk release - Julien Nioche
Jenkins build is back to normal : nutch-trunk-maven #226 - Apache Jenkins Server
Re: Build failed in Jenkins: Nutch-nutchgora #218 - Ferdy Galema
Build failed in Jenkins: Nutch-nutchgora #224 - Apache Jenkins Server
Jenkins build is back to normal : nutch-trunk-maven #237 - Apache Jenkins Server
[jira] [Created] (NUTCH-1334) NPE in FetcherOutputFormat - Julien Nioche (Created) (JIRA)
Re: NUTCH-1129 - Markus Jelsma
[jira] [Updated] (NUTCH-1335) OutlinkDB to collect unique URL's only - Markus Jelsma (Updated) (JIRA)
[jira] [Commented] (NUTCH-1339) Default URL normalization rules to remove page anchors completely - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1297) it is better for fetchItemQueues to select items from greater queues first - Ferdy Galema (Commented) (JIRA)
[jira] [Updated] (NUTCH-882) Design a Host table in GORA - Ferdy Galema (Updated) (JIRA)
[jira] [Created] (NUTCH-1345) JAVA_HOME should not be required - Ben McCann (JIRA)
[jira] [Commented] (NUTCH-1317) Max content length by MIME-type - Markus Jelsma (JIRA)
[jira] [Updated] (NUTCH-875) Port Webgraph to Nutch 2.0 - Lewis John McGibbney (JIRA)
[jira] [Updated] (NUTCH-978) A Plugin for extracting certain element of a web page on html page parsing. - Lewis John McGibbney (JIRA)

Page 3 (Messages 51 to 75): 1 2 3 4 5 6 7 8 9 10 11 12 13 14