Scale Data Science with Domino Data Lab
Scale Data Science with Domino Data Lab (Latest Version)

Step #2: Setup Rapids Workspace

  1. We will create a new workspace and run the Amira notebook.

  2. Open the new RAPIDS project, click Workspaces from the project menu.

    domino-step3_img1.png

  3. Click + Create New Workspace.

    domino-step3_img2.png

  4. For the Workspace Environment, choose RAPIDS from the dropdown.

  5. For the Workspace IDE, choose JupyterLab. (Make sure you click on this or you won’t be able to launch)

  6. For the Hardware Tier, choose GPU 6 cores - 45 GiB.

    domino-step3_img3.png

  7. Click Launch Now.

  8. Once the workspace starts, click Terminal from the launcher.

    domino-step3_img4.png

  9. Run the following commands to copy the RAPIDS sample projects into the Domino filesystem:

    Copy
    Copied!
                

    mkdir /mnt/repo cp -r /rapids/notebooks/repos/cuml /mnt/repo

    domino-step3_img5.png

  10. Close your Terminal window

  11. Click File Changes from the Workspace menu

  12. Click Sync All Changes.

    domino-step3_img6.png

  13. Under Notebook in JupyterLab, click on Python 3 (ipykernel)

    domino-step3_img7.png

  14. On the left directory structure, open repo -> cuml -> notebooks

  15. Click on arima_demo.ipynb to open it

    domino-step3_img8.png

  16. On the top, under Run, select Run all cells.

    domino-step3_img9.png

  17. Scroll down through the notebook to explore the results.

    domino-step3_img10.png

  18. There are a number of other notebooks within the folder. Check out this list to see if you any of them are interesting to you:

    • Forest Inference Demo: Save and load an XGBoost model into FIL and infer on new data.

    • KMeans Demo: Predict using k-means, visualize and compare the results with Scikit-learn’s k-means.

    • KMeans Multi-Node Multi-GPU Demo: Predict with MNMG k-means using dask distributed inputs.

    • Linear Regression Demo: Demonstrate the use of OLS Linear Regression for prediction.

    • Nearest Neighbors Demo: Predict using Nearest Neighbors algorithm.

    • Random Forest Demo: Use Random Forest for classification, and demonstrate how to pickle the cuML model.

    • Random Forest Multi-Node Multi-GPU Demo: Solve a classification problem using MNMG Random Forest.

    • Target Encoder Walkthrough: Understand how to use target encoding and why it is preferred over one-hot and label encoding with the help of criteo dataset for click-through rate modelling.

© Copyright 2022-2023, NVIDIA. Last updated on Jan 10, 2023.