197 messages

org.apache.lucene.nutch-dev [All Lists]

2015 July [All Months]

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8

Re: Squashing Git Commits - Mattmann, Chris A (3980)
[GitHub] nutch pull request: Nutch 2058 - New index-replace plugin that all... - chrismattmann
Jenkins build is back to normal : Nutch-trunk #3198 - Apache Jenkins Server
[jira] [Commented] (NUTCH-2060) dedup is removing entries with status db_gone - Ashish Nerkar (JIRA)
Build failed in Jenkins: Nutch-trunk #3207 - Apache Jenkins Server
[jira] [Commented] (NUTCH-2059) protocol-httpclient, protocol-http unit test errors on Jenkins - Peter Ciuffetti (JIRA)
[jira] [Commented] (NUTCH-2058) Indexer plugin that allows RegEx replacements on the NutchDocument field values - Peter Ciuffetti (JIRA)
[jira] [Created] (NUTCH-2062) Add Plugin for interacting with Selenium WebDriver - Michael Joyce (JIRA)
[jira] [Commented] (NUTCH-2062) Add Plugin for interacting with Selenium WebDriver - Michael Joyce (JIRA)
[Nutch Wiki] Trivial Update of "IndexReplace" by PeterCiuffetti - Apache Wiki
[jira] [Updated] (NUTCH-2021) Use protocol-selenium to Capture Screenshots of the Page as it is Fetched - Lewis John McGibbney (JIRA)
[jira] [Updated] (NUTCH-2062) Add Plugin for interacting with Selenium WebDriver - Michael Joyce (JIRA)
[jira] [Updated] (NUTCH-2063) Add -mimeStats flag to FileDumper tool - Michael Joyce (JIRA)
[jira] [Commented] (NUTCH-2066) Allow user to specify crawldb and segment db in the Generate JOb REST endpoint - ASF GitHub Bot (JIRA)
[jira] [Updated] (NUTCH-2042) parse-html increase chunk size used to detect charset - Sebastian Nagel (JIRA)
Re: [jira] [Updated] (NUTCH-2042) parse-html increase chunk size used to detect charset - Mattmann, Chris A (3980)
[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable - Lewis John McGibbney (JIRA)
[jira] [Updated] (NUTCH-2064) URLNormalizer basic to properly encode non-ASCII characters - Markus Jelsma (JIRA)
[jira] [Work started] (NUTCH-2062) Add Plugin for interacting with Selenium WebDriver - Chris A. Mattmann (JIRA)
[jira] [Assigned] (NUTCH-2062) Add Plugin for interacting with Selenium WebDriver - Chris A. Mattmann (JIRA)
[jira] [Commented] (NUTCH-2062) Add Plugin for interacting with Selenium WebDriver - Michael Joyce (JIRA)
[jira] [Updated] (NUTCH-2072) Deflate encoding support is broken when http.content.limit is set to -1 - Tanguy Moal (JIRA)
[jira] [Created] (NUTCH-2072) Deflate encoding support is broken when http.content.limit is set to -1 - Tanguy Moal (JIRA)
[jira] [Commented] (NUTCH-2069) Ignore external links based on domain - Julien Nioche (JIRA)
[jira] [Commented] (NUTCH-1785) Ability to index raw content - Hudson (JIRA)

Page 1 (Messages 1 to 25): 1 2 3 4 5 6 7 8