Apparent Memory Issues #10

johnbutler123 · 2018-01-14T20:29:04Z

juyptererror.txt
commandprompt.txt
commandprompterror.txt

Hi - I am a student attempting to learn how to use PYSPSARK/JUPYTER to build classification models for large data. I installedPYSPARK V2.2.1 and Juypter as per tutorial on medium website by Michael Galarnyk. It seemed to install ok and I was able to run your first notebook. However in the second notebook nb2-rdd-basics I had problems with the "collect" code

from time import time
t0 = time()
head_rows = csv_data.take(100000)
tt = time() - t0
print "Parse completed in {} seconds".format(round(tt,3))
Thinking it was a memory issue I then launched Jupyter with command
pyspark --master local[4] --driver-memory 32g --executor-memory 32g
I have attached the Juypter error and command prompt data before and after error
Please help - how do I increase memory in the kernel

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apparent Memory Issues #10

Apparent Memory Issues #10

johnbutler123 commented Jan 14, 2018

Apparent Memory Issues #10

Apparent Memory Issues #10

Comments

johnbutler123 commented Jan 14, 2018