Tutorials#
Learn how to run NeMo Safe Synthesizer jobs through hands-on tutorials to generate private synthetic versions of sensitive tabular datasets. Each tutorial provides step-by-step guidance with executable code examples.
Prerequisites#
Before starting any tutorial, ensure you have:
NeMo Safe Synthesizer deployed using Docker Compose or Helm
Python environment with
nemo-platformSDK installed:pip install nemo-platform[safe-synthesizer]
Jupyter environment for running tutorial notebooks
Getting Started#
Learn the basics with your first Safe Synthesizer job, leveraging smart defaults. This tutorial covers uploading data, running a synthesis job with PII replacement, and reviewing evaluation reports.
Topics covered:
Installing the SDK
Connecting to Safe Synthesizer
Using
SafeSynthesizerJobBuilderMonitoring job progress
Retrieving synthetic data and evaluation reports
Advanced Topics#
Apply differential privacy to achieve the maximum level of privacy with mathematical guarantees. This tutorial explores the privacy-utility tradeoff and how to configure differential privacy parameters.
Topics covered:
Understanding differential privacy concepts (epsilon, delta)
Configuring privacy hyperparameters
Privacy budget analysis
Evaluating privacy-utility tradeoffs
Interpreting privacy metrics
Additional Resources#
After completing these tutorials, explore:
About Generating Safe Synthetic Data: Understand core concepts and components
Need Help?#
Check the GitHub Issues for known issues
Review the Safe Synthesizer Jobs guide for job management