88 messages

org.apache.lucene.nutch-dev [All Lists]

2013 November [All Months]

Page 2 (Messages 26 to 50): 1 2 3 4

[jira] [Commented] (NUTCH-1640) OOM in ParseSegment Phase - Ian H. (JIRA)
[jira] [Commented] (NUTCH-1640) OOM in ParseSegment Phase - Julien Nioche (JIRA)
[jira] [Updated] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - Lewis John McGibbney (JIRA)
[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - Yasin Kılınç (JIRA)
[jira] [Issue Comment Deleted] (NUTCH-1640) OOM in ParseSegment Phase - Ian H. (JIRA)
[jira] [Resolved] (NUTCH-1588) Port NUTCH-1245 URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again to 2.x - Lewis John McGibbney (JIRA)
[jira] [Commented] (NUTCH-1651) modifiedTime and prevmodifiedTime never set - Lewis John McGibbney (JIRA)
[jira] [Created] (NUTCH-1661) Language based crawling - Talat UYARER (JIRA)
[jira] [Commented] (NUTCH-1371) Replace Ivy with Maven Ant tasks - Lewis John McGibbney (JIRA)
[jira] [Commented] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again - Hudson (JIRA)
[jira] [Updated] (NUTCH-1517) CloudSearch indexer - Tom Hill (JIRA)
[jira] [Updated] (NUTCH-1663) Crawl page with specified language - İlhami KALKAN (JIRA)
[jira] [Updated] (NUTCH-1660) Index filter for Page's latitude and longitude - Yasin Kılınç (JIRA)
[jira] [Resolved] (NUTCH-1100) SolrDedup broken - Julien Nioche (JIRA)
[jira] [Commented] (NUTCH-1324) DupeDB for Nutch - Markus Jelsma (JIRA)
[jira] [Comment Edited] (NUTCH-1646) IndexerMapReduce to consider DB status - Sebastian Nagel (JIRA)
[jira] [Commented] (NUTCH-1669) FTP crawl does not use FTP's server root folder - Rafael Thomas Goz Coutinho (JIRA)
[jira] [Resolved] (NUTCH-1309) fetch queue management - Julien Nioche (JIRA)
[jira] [Commented] (NUTCH-1667) Updatedb always ignore batchId - lufeng (JIRA)
[jira] [Commented] (NUTCH-1671) indexchecker to add digest field - lufeng (JIRA)
[jira] [Commented] (NUTCH-1630) How to achieve finishing fetch approximately at the same time for each queue (a.k.a adaptive queue size) - Otis Gospodnetic (JIRA)
[jira] [Commented] (NUTCH-1297) it is better for fetchItemQueues to select items from greater queues first - Julien Nioche (JIRA)
[jira] [Commented] (NUTCH-1630) How to achieve finishing fetch approximately at the same time for each queue (a.k.a adaptive queue size) - Julien Nioche (JIRA)
[jira] [Commented] (NUTCH-1647) protocol-http throws unzipBestEffort returned null for some pages - Luke (JIRA)
[jira] [Commented] (NUTCH-1297) it is better for fetchItemQueues to select items from greater queues first - Otis Gospodnetic (JIRA)

Page 2 (Messages 26 to 50): 1 2 3 4