347 messages

org.apache.lucene.nutch-dev [All Lists]

2012 April [All Months]

Page 11 (Messages 251 to 275): 1 2 3 4 5 6 7 8 9 10 11 12 13 14

[jira] [Created] (NUTCH-1325) HostDB for Nutch - Markus Jelsma (Created) (JIRA)
[jira] [Updated] (NUTCH-1183) Summary task for adding command line usage instructions to webgraph classes - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1014) Migrate from Apache ORO to java.util.regex - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1123) JUnit test for scoring-link - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1122) JUnit test for protocol-ftp - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1120) JUnit test for microformats-reltag - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1179) Option to restrict generated records by metadata - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1202) Fetcher timebomb kills long waiting fetch jobs - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1039) Fetcher fails for pages without content-length header - Markus Jelsma (Updated) (JIRA)
[jira] [Commented] (NUTCH-1270) some of Deflate encoded pages not fetched - behnam nikbakht (Commented) (JIRA)
[jira] [Created] (NUTCH-1329) parser not extract outlinks to external web sites - behnam nikbakht (Created) (JIRA)
[jira] [Commented] (NUTCH-366) Merge URLFilters and URLNormalizers - Yangxiaolong (Commented) (JIRA)
Build failed in Jenkins: Nutch-trunk #1813 - Apache Jenkins Server
[jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed - Markus Jelsma (Commented) (JIRA)
[jira] [Updated] (NUTCH-1340) Increase scalability by only removing markers when they actually exist for DbUpdaterReducer - Ferdy Galema (Updated) (JIRA)
[jira] [Commented] (NUTCH-882) Design a Host table in GORA - Ferdy Galema (Commented) (JIRA)
Re: [VOTE] Apache Nutch 1.5 release rc #1 - Julien Nioche
[jira] [Updated] (NUTCH-1341) NotModified time set to now but page not modified - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1344) BasicURLNormalizer to normalize https same as http - Sebastian Nagel (JIRA)
[jira] [Updated] (NUTCH-1158) Write JUnit tests for all nutchgora plugins - Lewis John McGibbney (JIRA)
[jira] [Updated] (NUTCH-1169) Write JUnit tests for urlfilter-prefix - Lewis John McGibbney (JIRA)
[jira] [Updated] (NUTCH-1094) create comprehensive documentation for Nutchgora branch - Lewis John McGibbney (JIRA)
[jira] [Closed] (NUTCH-1290) crawlId not supported by all Tools - Ferdy Galema (JIRA)
[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml - Lewis John McGibbney (JIRA)

Page 11 (Messages 251 to 275): 1 2 3 4 5 6 7 8 9 10 11 12 13 14