2,890 messages

org.apache.spark.issues [All Lists]

2014 September [All Months]

Page 41 (Messages 1001 to 1025): 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116

[jira] [Created] (SPARK-3336) [Spark SQL] In pyspark, cannot group by field on UDF - kay feng (JIRA)
[jira] [Updated] (SPARK-3339) Support for skipping json lines that fail to parse - Michael Armbrust (JIRA)
[jira] [Created] (SPARK-3357) Internal log messages should be set at DEBUG level instead of INFO - Xiangrui Meng (JIRA)
[jira] [Updated] (SPARK-3061) Maven build fails in Windows OS - Patrick Wendell (JIRA)
[jira] [Updated] (SPARK-2334) Attribute Error calling PipelinedRDD.id() in pyspark - Josh Rosen (JIRA)
[jira] [Commented] (SPARK-3129) Prevent data loss in Spark Streaming - Hari Shreedharan (JIRA)
[jira] [Created] (SPARK-3490) Alleviate port collisions during tests - Andrew Or (JIRA)
[jira] [Updated] (SPARK-3490) Alleviate port collisions during tests - Andrew Or (JIRA)
[jira] [Updated] (SPARK-2321) Design a proper progress reporting & event listener API - Reynold Xin (JIRA)
[jira] [Updated] (SPARK-1866) Closure cleaner does not null shadowed fields when outer scope is referenced - Patrick Wendell (JIRA)
[jira] [Commented] (SPARK-3067) JobProgressPage could not show Fair Scheduler Pools section sometimes - Josh Rosen (JIRA)
[jira] [Commented] (SPARK-3561) Native Hadoop/YARN integration for batch/ETL workloads - Oleg Zhurakousky (JIRA)
[jira] [Updated] (SPARK-3537) Statistics for cached RDDs - Michael Armbrust (JIRA)
[jira] [Updated] (SPARK-3573) Dataset - Xiangrui Meng (JIRA)
[jira] [Updated] (SPARK-3579) Jekyll doc generation is different across environments - Patrick Wendell (JIRA)
[jira] [Updated] (SPARK-3583) Spark run slow after unexpected repartition - ShiShu (JIRA)
[jira] [Updated] (SPARK-3614) Filter on minimum occurrences of a term in IDF - Jatinpreet Singh (JIRA)
[jira] [Commented] (SPARK-3431) Parallelize execution of tests - Nicholas Chammas (JIRA)
[jira] [Commented] (SPARK-3675) Allow starting JDBC server on an existing context - Apache Spark (JIRA)
[jira] [Commented] (SPARK-3679) pickle the exact globals of functions - Apache Spark (JIRA)
[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib - Xiangrui Meng (JIRA)
[jira] [Commented] (SPARK-3685) Spark's local dir scheme is not configurable - Patrick Wendell (JIRA)
[jira] [Updated] (SPARK-3721) Broadcast Variables above 2GB break in PySpark - Brad Miller (JIRA)
[jira] [Updated] (SPARK-3745) curl on maven search repo (apache rat) url returns search status, not jar file - shane knapp (JIRA)
[jira] [Updated] (SPARK-3356) Document when RDD elements' ordering within partitions is nondeterministic - Matei Zaharia (JIRA)

Page 41 (Messages 1001 to 1025): 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116