611 messages

org.apache.spark.user [All Lists]

2017 March [All Months]

Page 2 (Messages 26 to 50): 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25

Combining reading from Kafka and HDFS w/ Spark Streaming - Mike Thomsen
RE: How to run a spark on Pycharm - Sidney Feiner
Not able to remove header from a text file while creating a data frame . - PSw...@in.imshealth.com
Re: LinearRegressionModel - Negative Predicted Value - Sean Owen
Re: org.apache.spark.SparkException: Task not serializable - Ankur Srivastava
Re: Check if dataframe is empty - Nick Pentreath
How to improve performance of saveAsTextFile() - Parsian, Mahmoud
Re: java.io.NotSerializableException: org.apache.spark.streaming.StreamingContext - 萝卜丝炒饭
Re: How to improve performance of saveAsTextFile() - 颜发才(Yan Facai)
Structured Streaming - Can I start using it? - Gaurav1809
Graphframes PageRank ends up on 1 partition - Olivier Girardot
Re: Structured Streaming - Can I start using it? - Adline Dsilva
Re: [MLlib] kmeans random initialization, same seed every time - Julian Keppel
[Spark CSV]: Use Custom TextInputFormat to Prevent Exceptions - Nathan Case
RE: RE: Fast write datastore... - Mal Edwin
Re: Dataset : Issue with Save - Bahubali Jain
[Spark SQL & Core]: RDD to Dataset 1500 columns data with createDataFrame() throw exception of grows beyond 64 KB - elevy
How to redistribute dataset without full shuffle - Artur R
Re: calculate diff of value and median in a group - Yong Zhang
Does spark's random forest need categorical features to be one hot encoded? - Aseem Bansal
Re: [Worker Crashing] OutOfMemoryError: GC overhead limit execeeded - Yong Zhang
apache-spark: Converting List of Rows into Dataset Java - Karin Valisova
Returning DataFrame for text file - George Obama
Spark SQL 2.1 Complex SQL - Query Planning Issue - Sathish Kumaran Vairavelu
Partitioning in spark while reading from RDBMS via JDBC - Devender Yadav

Page 2 (Messages 26 to 50): 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25