187 messages

org.apache.lucene.nutch-dev [All Lists]

2006 October [All Months]

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8

Re: Nutch requires JDK 1.5 now? - Chris Mattmann
[jira] Commented: (NUTCH-353) pages that serverside forwards will be refetched every time - Doug Cutting (JIRA)
Re: Nutch requires JDK 1.5 now? - Sami Siren
Problem parsing some MS Excel & other formats (Office 2003) - tryma
Re: Problem parsing some MS Excel & other formats (Office 2003) - tryma
[jira] Updated: (NUTCH-379) ParseUtil does not pass through the content's URL to the ParserFactory - Sami Siren (JIRA)
[jira] Closed: (NUTCH-371) DeleteDuplicates should remove documents with duplicate URLs - Andrzej Bialecki (JIRA)
[jira] Created: (NUTCH-387) host normalization in Generator$Selector - Johannes Zillmann (JIRA)
RE: Issue with Boosting Fields - ian....@thomson.com
Re: Problem parsing some MS Excel & other formats (Office 2003) - Aisha
Re: Problem parsing some MS Excel & other formats (Office 2003) - Andrzej Bialecki
RE: I modify NutchAnalysis.jj and NutchDocumentTokenizer.java to let nutch support chinese word. - Teruhiko Kurosaka
RE: What javacc options should I use to compile NutchAnalysis.jj? - Teruhiko Kurosaka
outlink extractor finds lots of junk - AJ Chen
[jira] Updated: (NUTCH-185) XMLParser is configurable xml parser plugin. - nutch.newbie (JIRA)
[jira] Closed: (NUTCH-52) Parser plugin for MS Excel files - Sami Siren (JIRA)
[jira] Closed: (NUTCH-108) tasktracker crashs when reconnecting to a new jobtracker. - Sami Siren (JIRA)
[jira] Closed: (NUTCH-137) footer is not displayed in search result page - Sami Siren (JIRA)
[jira] Closed: (NUTCH-166) secure jobtracker info pages with a password - Sami Siren (JIRA)
[jira] Closed: (NUTCH-221) prepare nutch for upcoming lucene 2.0 - Sami Siren (JIRA)
[jira] Closed: (NUTCH-257) Summary#toString always Entity encodes -- problem for OpenSearchServlet#description field - Sami Siren (JIRA)
nutch search coop - Ben van Klinken
[jira] Created: (NUTCH-393) Indexer doesn't handle null documents returned by filters - Eelco Lempsink (JIRA)
[jira] Commented: (NUTCH-394) Searching via Tomcat / nutch-0.9-dev.war raises exception - Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-395) Increase fetching speed - Andrzej Bialecki (JIRA)

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8