Jupyter
Jupyter allows users to manipulate their Safe Haven data interactively using Python or PySpark. In addition to Notebooks, Jupyter provides a text editor and a command-line terminal.
Only users with the LSH Admin persona or LSH Data Scientist persona can access Jupyter. To learn more, see User Personas.
Note
Jupyter can be accessed either through the JupyterLab application or the Jupyter Notebook application. Both applications provide similar functionality. JupyterLab allows you to open several notebooks at the same time and customize its UI.
JupyterLab and Jupyter Notebook can be opened from the Analytics Environment virtual machine desktop.
Tasks You Can Perform with Jupyter
Jupyter and its related applications can be used to perform the following tasks:
Use the BigQuery Python API to run complex data processing operations on BigQuery
Use PySpark with a data manipulation structure such as PySpark DataFrame to analyze data
Use the following Python machine learning libraries included in the Analytics Environment (see each library below for installation instructions and usage examples):
Note
The H2O Python machine learning library is not supported by the latest Dataproc version.