Aprendizaje automático con Spark en Google Cloud DataprocIr al Laboratorio
Lab is broken and not tracking progress....
The lab doesn't work and doesn't tell you what to remember to use later.
There is a problem with the folder "07_parkml" name that is causing the lab not to be validated, I opened a ticket, and I hope they will fix it.
THIS LAB NEEDS TO BE FIXED!!! THE NOTEBOOK OPENS IN PYTHON 3 THEN THROWS AN ERROR - WHEN YOU TRY TO RERUN YOU GET A SPARK CALL ERROR - THEN YOU CANNOT GET CREDIT FOR A WORKING NOTEBOOK! HAS HAPPENED MULTIPLE TIMES! PLEASE FIX THIS FOR THE SAKE OF EVERYONE ELSE! ValueErrorTraceback (most recent call last) <ipython-input-2-0ad623086190> in <module>() 4 from pyspark import SparkContext 5 ----> 6 sc = SparkContext('local', 'logistic') 7 spark = SparkSession .builder .appName("Logistic regression w/ Spark ML") .getOrCreate() 8 /usr/lib/spark/python/lib/pyspark.zip/pyspark/context.py in __init__(self, master, appName, sparkHome, pyFiles, environment, batchSize, serializer, conf, gateway, jsc, profiler_cls) 127 " note this option will be removed in Spark 3.0") 128 --> 129 SparkContext._ensure_initialized(self, gateway=gateway, conf=conf) 130 try: 131 self._do_init(master, appName, sparkHome, pyFiles, environment, batchSize, serializer, /usr/lib/spark/python/lib/pyspark.zip/pyspark/context.py in _ensure_initialized(cls, instance, gateway, conf) 326 " created by %s at %s:%s " 327 % (currentAppName, currentMaster, --> 328 callsite.function, callsite.file, callsite.linenum)) 329 else: 330 SparkContext._active_spark_context = instance ValueError: Cannot run multiple SparkContexts at once; existing SparkContext(app=pyspark-shell, master=yarn) created by getOrCreate at /usr/lib/spark/python/pyspark/shell.py:45
Not functional in many commands since python is based on version 2.3 for spark
NOT WORKING 2 Times
Issue with lab. Fix by cloning trycustom branch: !git clone --branch trycustom https://github.com/GoogleCloudPlatform/data-science-on-gcp
Issue with datalab/notebooks/data-science-on-gcp/07_sparkml ?
horrible, things don't work
score doesn't update even when all steps completed
python version on jupyter default