Skip to main content

Ctrl+K

Spark RAPIDS User Guide

Spark RAPIDS User Guide

Table of Contents

Overview

Overview

Qualification Tool

Overview
Quickstart
Jar Usage
Output Details

Getting Started

Overview
RAPIDS Accelerator with On-prem Cluster or Local Mode
RAPIDS Accelerator on AWS EMR
RAPIDS Accelerator on Databricks
RAPIDS Accelerator on GCP Dataproc
RAPIDS Accelerator on Dataproc Serverless
RAPIDS Accelerator on Azure Synapse Analytics
RAPIDS and Kubernetes
RAPIDS and Alluxio
Spark Workload Qualification
RAPIDS Accelerator on Oracle Cloud Infrastructure
Spark3 GPU Configuration Guide on Yarn 3.2.1

Tuning

Tuning Guide
Best Practices on the RAPIDS Accelerator for Apache Spark

Profiling Tool

Overview
Quickstart
Jar Usage
Output Details

Additional Functionality

RAPIDS Accelerator for Apache Spark ML Library Integration
RAPIDS Shuffle Manager
Apache Iceberg Support
Delta Lake Support
RAPIDS Accelerator File Cache

Appendixes

Frequently Asked Questions
Examples
Glossary
Contact Us

RAPIDS Accelerator for Apache Spark - User Guide#

Overview

Overview

Qualification Tool

Overview
Quickstart
- Install
  - Prerequisites
  - Install the CLI Package
- Running the Tool
Jar Usage
- Setting Up Environment
  - Prerequisites
  - Getting the Tools Jar
- Deploying Tools Jar
Output Details

Getting Started

Overview
RAPIDS Accelerator with On-prem Cluster or Local Mode
RAPIDS Accelerator on AWS EMR
RAPIDS Accelerator on Databricks
RAPIDS Accelerator on GCP Dataproc
RAPIDS Accelerator on Dataproc Serverless
RAPIDS Accelerator on Azure Synapse Analytics
RAPIDS and Kubernetes
RAPIDS and Alluxio
Spark Workload Qualification
RAPIDS Accelerator on Oracle Cloud Infrastructure
Spark3 GPU Configuration Guide on Yarn 3.2.1

Tuning

Tuning Guide
Best Practices on the RAPIDS Accelerator for Apache Spark

Profiling Tool

Overview
Quickstart
- Install
  - Prerequisites
  - Install the CLI Package
- Running The Tool
Jar Usage
- Setting Up Environment
  - Prerequisites
  - Getting the Tools Jar
- Running Tools Jar
Output Details
- Understanding The Profiling Output
- Generating Visualizations

Additional Functionality

RAPIDS Accelerator for Apache Spark ML Library Integration
- Existing ML Libraries
  - RMM
- GPU accelerated ML Library development
  - ColumnarRdd
  - Examples of Spark ML Implementations leveraging ColumnarRdd
RAPIDS Shuffle Manager
- Multi-Threaded Mode
- UCX Mode
Apache Iceberg Support
Delta Lake Support
RAPIDS Accelerator File Cache

Appendixes

Frequently Asked Questions
Examples
Glossary
Contact Us
- Technical Support
- Issues and Discussions

next

Overview

Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2023-2024, NVIDIA Corporation.

Last updated on Oct 23, 2024.