Troubleshooting
This section provides comprehensive troubleshooting guidance for common issues you may encounter while deploying, configuring, or operating the DOCA Platform Framework (DPF).
Quick Diagnostic Tools
DPF CLI (dpfctl)
Command-line tool for visualizing, debugging, and troubleshooting DPU resources in Kubernetes. Essential for real-time visibility into resource states and conditions.
Use when:
- DPU provisioning is failing
- Need to understand resource dependencies
- Debugging component readiness issues
System Reports (sosreport)
Generate comprehensive system reports for deeper analysis and support requests.
Use when:
- Need detailed system information for support cases
- Investigating complex infrastructure issues
- Preparing diagnostic data for NVIDIA support
Escalation Path
If you cannot resolve the issue using the guides above:
Collect Diagnostic Information
- Generate a sosreport for your environment
Check Known Issues
- Review Release Notes for known issues
- Search the GitHub repository for similar problems
Contact Support
- Open an issue on the GitHub repository
- Include diagnostic information and steps to reproduce
- For enterprise customers, contact NVIDIA support with your diagnostic package
Additional Resources
- User Guides - Operational procedures and best practices
- Architecture - Understanding system design for better troubleshooting
- API Reference - Complete API documentation for debugging configurations