189 messages

org.apache.lucene.nutch-dev [All Lists]

2007 May [All Months]

Page 2 (Messages 26 to 50): 1 2 3 4 5 6 7 8

Re: SIGSEGV - Dennis Kubes
Re: How to install Nutch on Freebsd? - Nuther
Document Classification - indexing question - Bastian Preindl
Build failed in Hudson: Nutch-Nightly #80 - hud...@lucene.zones.apache.org
Hudson build is back to normal: Nutch-Nightly #81 - hud...@lucene.zones.apache.org
[jira] Created: (NUTCH-482) Remove redundant plugin lib-log4j - Sami Siren (JIRA)
Re: Site nightly API link is broken - Sami Siren
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - Doğacan Güney (JIRA)
[jira] Reopened: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - Chris A. Mattmann (JIRA)
[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility - Chris A. Mattmann (JIRA)
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - Doğacan Güney (JIRA)
[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-486) Break searcher dependency on commons-cli - Andrzej Bialecki (JIRA)
[jira] Updated: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch) - Marcin Okraszewski (JIRA)
[jira] Updated: (NUTCH-489) URLFilter-suffix management of the url path when the url contains some query parameters - Emmanuel Joke (JIRA)
IntelliJ & Eclipse Lucene code styles available - Otis Gospodnetic
Get meta name="description" and other meta tags from Content - Yakn
Re: Get meta name="description" and other meta tags from Content - Andrzej Bialecki
Re: Plugins initialized all the time! - Nicolás Lichtmaier
Re: Plugins initialized all the time! - Doğacan Güney
[jira] Resolved: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content - Andrzej Bialecki (JIRA)
Re: Plugins initialized all the time! - Andrzej Bialecki
[jira] Updated: (NUTCH-494) FindBugs: CrawlDbReader and DeleteDuplicates - Doğacan Güney (JIRA)
[jira] Updated: (NUTCH-466) Flexible segment format - Andrzej Bialecki (JIRA)
Re: [jira] Resolved: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content - Andrzej Bialecki

Page 2 (Messages 26 to 50): 1 2 3 4 5 6 7 8