296 messages

org.apache.lucene.nutch-dev [All Lists]

2012 January [All Months]

Page 7 (Messages 151 to 175): 1 2 3 4 5 6 7 8 9 10 11 12

[jira] [Resolved] (NUTCH-1232) Remove host field from index-basic - Markus Jelsma (Resolved) (JIRA)
[jira] [Resolved] (NUTCH-1041) Not reading mime-type correctly - Markus Jelsma (Resolved) (JIRA)
[jira] [Resolved] (NUTCH-1106) Options to skip url's based on length - Markus Jelsma (Resolved) (JIRA)
[jira] [Issue Comment Edited] (NUTCH-1220) Upgrade Solr deps - X Yang (Issue Comment Edited) (JIRA)
[jira] [Resolved] (NUTCH-1146) Get rid of _success files in webgraph code - Julien Nioche (Resolved) (JIRA)
[jira] [Created] (NUTCH-1244) CrawlDBDumper to filter by regex - Markus Jelsma (Created) (JIRA)
[jira] [Updated] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again - Markus Jelsma (Updated) (JIRA)
[jira] [Updated] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment - Markus Jelsma (Updated) (JIRA)
Build failed in Jenkins: Nutch-nutchgora #124 - Apache Jenkins Server
[jira] [Updated] (NUTCH-1138) remove LogUtil from trunk and nutch gora - Lewis John McGibbney (Updated) (JIRA)
[jira] [Closed] (NUTCH-1189) add commented out default settings to gora.properties files - Lewis John McGibbney (Closed) (JIRA)
[jira] [Commented] (NUTCH-809) Parse-metatags plugin - Lewis John McGibbney (Commented) (JIRA)
[jira] [Resolved] (NUTCH-1248) Generator to select on status - Markus Jelsma (Resolved) (JIRA)
[jira] [Commented] (NUTCH-1177) Generator to select on retry interval - Hudson (Commented) (JIRA)
[jira] [Commented] (NUTCH-1201) Allow for different FetcherThread impls - Andrzej Bialecki (Commented) (JIRA)
Re: I want to volunteer some time - Markus Jelsma
[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2-incubating in ivy/ivy.xml - Ferdy Galema (Updated) (JIRA)
Re: [DISCUSS] Issues with Fetcher - Edward Drapkin
[jira] [Commented] (NUTCH-1255) Change ivy.xml of all plugins to remove "nutch.root" property - Hudson (Commented) (JIRA)
[jira] [Commented] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again - Sebastian Nagel (Commented) (JIRA)
[jira] [Commented] (NUTCH-1113) Merging segments causes URLs to vanish from crawldb/index? - Sebastian Nagel (Commented) (JIRA)
[jira] [Created] (NUTCH-1260) Fetcher should log fetching of redirects - Sebastian Nagel (Created) (JIRA)
[jira] [Commented] (NUTCH-1259) TikaParser should not add Content-Type from HTTP Headers to Nutch Metadata - Markus Jelsma (Commented) (JIRA)
[jira] [Commented] (NUTCH-1256) WebGraph to dump host + score - Lewis John McGibbney (Commented) (JIRA)
[jira] [Updated] (NUTCH-1242) Allow disabling of URL Filters in ParseSegment - Markus Jelsma (Updated) (JIRA)

Page 7 (Messages 151 to 175): 1 2 3 4 5 6 7 8 9 10 11 12