189 messages

org.apache.lucene.nutch-dev [All Lists]

2007 May [All Months]

Page 6 (Messages 126 to 150): 1 2 3 4 5 6 7 8

Build failed in Hudson: Nutch-Nightly #74 - hud...@lucene.zones.apache.org
[jira] Updated: (NUTCH-477) Extend URLFilters to support different filtering chains - Andrzej Bialecki (JIRA)
Scope-based crawling and indexing - Vikas
[jira] Resolved: (NUTCH-393) Indexer doesn't handle null documents returned by filters - Andrzej Bialecki (JIRA)
Re: svn commit: r536606 - in /lucene/nutch/trunk: ./ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/metadata/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/util/ src/plugin/creativecommons/src/test/org/creativecommons/nutch/ src/... - Sami Siren
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility - Doğacan Güney (JIRA)
[jira] Created: (NUTCH-484) Nutch Nightly API link is broken in site - Gal Nitzan (JIRA)
[jira] Updated: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object - Gal Nitzan (JIRA)
[jira] Commented: (NUTCH-485) Change HtmlParseFilter 's to return ParseResult object instead of Parse object - Doğacan Güney (JIRA)
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - Doğacan Güney (JIRA)
[jira] Updated: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility - Doğacan Güney (JIRA)
[jira] Resolved: (NUTCH-457) Create top level dist directory and checkin KEYS file to subversion be standard with Lucene Java and Hadoop - Sami Siren (JIRA)
[jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - Doğacan Güney (JIRA)
[jira] Created: (NUTCH-486) Break searcher dependency on commons-cli - Mark Woon (JIRA)
[jira] Commented: (NUTCH-25) needs 'character encoding' detector - Doug Cook (JIRA)
[jira] Commented: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. - Vadim Bauer (JIRA)
[jira] Commented: (NUTCH-25) needs 'character encoding' detector - Doug Cook (JIRA)
[jira] Updated: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. - Vadim Bauer (JIRA)
[jira] Created: (NUTCH-493) contentType parse not correctly,,,,got empty content using readseg -get - wangxu (JIRA)
Committer - Chris Mattmann
Re: Plugins initialized all the time! - Doğacan Güney
[jira] Created: (NUTCH-494) FindBugs: CrawlDbReader and DeleteDuplicates - Doğacan Güney (JIRA)
[jira] Commented: (NUTCH-466) Flexible segment format - Doğacan Güney (JIRA)
[jira] Updated: (NUTCH-466) Flexible segment format - Andrzej Bialecki (JIRA)
[PATCH] Moving HitDetails construction to a constructor =) - Nicolás Lichtmaier

Page 6 (Messages 126 to 150): 1 2 3 4 5 6 7 8