3,332 messages

org.apache.spark.issues [All Lists]

2017 May [All Months]

Page 4 (Messages 76 to 100): 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134

[jira] [Created] (SPARK-20574) Allow Bucketizer to handle non-Double column - Wayne Zhang (JIRA)
[jira] [Commented] (SPARK-20569) RuntimeReplaceable functions accept invalid third parameter - Michael Armbrust (JIRA)
[jira] [Commented] (SPARK-11834) Ignore thresholds in LogisticRegression and update documentation - Apache Spark (JIRA)
[jira] [Created] (SPARK-20638) Optimize the CartesianRDD to reduce repeatedly data fetching - Teng Jiang (JIRA)
[jira] [Assigned] (SPARK-20638) Optimize the CartesianRDD to reduce repeatedly data fetching - Apache Spark (JIRA)
[jira] [Commented] (SPARK-20668) Modify ScalaUDF to handle nullability. - Apache Spark (JIRA)
[jira] [Commented] (SPARK-20682) Support a new faster ORC data source based on Apache ORC - Dongjoon Hyun (JIRA)
[jira] [Assigned] (SPARK-20686) PropagateEmptyRelation incorrectly handles aggregate without grouping expressions - Apache Spark (JIRA)
[jira] [Resolved] (SPARK-20431) Support a DDL-formatted string in DataFrameReader.schema - Xiao Li (JIRA)
[jira] [Updated] (SPARK-11968) ALS recommend all methods spend most of time in GC - Xiao Li (JIRA)
[jira] [Commented] (SPARK-20502) ML, Graph 2.2 QA: API: Experimental, DeveloperApi, final, sealed audit - Joseph K. Bradley (JIRA)
[jira] [Commented] (SPARK-20506) ML, Graph 2.2 QA: Programming guide update and migration guide - Nick Pentreath (JIRA)
[jira] [Comment Edited] (SPARK-20503) ML 2.2 QA: API: Python API coverage - Nick Pentreath (JIRA)
[jira] [Commented] (SPARK-20746) Built-in SQL Function Improvement - Takeshi Yamamuro (JIRA)
[jira] [Commented] (SPARK-19089) Support nested arrays/seqs in Datasets - Apache Spark (JIRA)
[jira] [Created] (SPARK-20788) Fix the Executor task reaper's false alarm warning logs - Shixiong Zhu (JIRA)
[jira] [Commented] (SPARK-20747) Distinct in Aggregate Functions - Takeshi Yamamuro (JIRA)
[jira] [Assigned] (SPARK-18825) Eliminate duplicate links in SparkR API doc index - Apache Spark (JIRA)
[jira] [Commented] (SPARK-20803) KernelDensity.estimate in pyspark.mllib.stat.KernelDensity throws net.razorvine.pickle.PickleException when input data is normally distributed (no error when data is not normally distributed) - Bettadapura Srinath Sharma (JIRA)
[jira] [Commented] (SPARK-20178) Improve Scheduler fetch failures - Thomas Graves (JIRA)
[jira] [Resolved] (SPARK-20848) Dangling threads when reading parquet files in local mode - Wenchen Fan (JIRA)
[jira] [Commented] (SPARK-20640) Make rpc timeout and retry for shuffle registration configurable - Apache Spark (JIRA)
[jira] [Commented] (SPARK-20787) PySpark can't handle datetimes before 1900 - Yan Facai (颜发才) (JIRA)
[jira] [Created] (SPARK-20897) cached self-join should not fail - Wenchen Fan (JIRA)
[jira] [Assigned] (SPARK-20916) Improve error message for unaliased subqueries in FROM clause - Wenchen Fan (JIRA)

Page 4 (Messages 76 to 100): 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134