1. Home
  2. NVIDIA
  3. NCP-AII Exam Syllabus

NVIDIA NCP-AII Exam Syllabus

Start Free NCP-AII Exam Practice After Reviewing the Topics

Before starting your NCP-AII exam preparation, it is recommended to review the complete NVIDIA AI Infrastructure exam syllabus and carefully go through the exam objectives listed below. Once you understand the exam structure and objectives, you should practice using our free NCP-AII questions. We also provide premium NCP-AII practice test, fully updated according to the latest exam objectives, to help you accurately assess your preparedness for the actual exam.

NVIDIA NCP-AII Exam Objectives

Section Weight Objectives
System and Server Bring-up 31%
  • Describe sequence of events for deployment and validation.
  • Describe network topologies for AI factories.
  • Perform initial configuration of BMC, OOB, and TPM. 
  • Perform firmware upgrades (including on HGX™) and fault detection.
  • Validate power and cooling parameters.
  • Install GPU-based servers (SMI).
  • Validate installed hardware.
  • Describe and validate cable types and transceivers.
  • Install physical GPUs.
  • Validate hardware operation for workloads.
  • Configure initial parameters for third-party storage.
Physical Layer Management 5%
  • Configure and manage a BlueField® network platform.
  • Configure MIG (AI and HPC).
Control Plane Installation and Configuration 19%
  • Install Base Command™ Manager (BCM), configure and verify HA.
  • Install OS.
  • Install Cluster (configure category, configure interfaces, install Slurm/Enroot/Pyxis).
  • Install/update/remove NVIDIA GPU and DOCA™ drivers.
  • Install the NVIDIA container toolkit.
  • Demonstrate how to use NVIDIA GPUs with Docker.
  • Install NGC™ CLI on hosts.
Cluster Test and Verification 33%
  • Perform a single-node stress test.
  • Execute HPL (High-Performance Linpack).
  • Perform single-node NCCL (including verifying NVLink™ Switch).
  • Validate cables by verifying signal quality.
  • Confirm cabling is correct.
  • Confirm FW/SW on switches.
  • Confirm FW/SW on BlueField-3.
  • Confirm FW on transceivers.
  • Run ClusterKit to perform a multifaceted node assessment.
  • Run NCCL to verify E/W fabric bandwidth.
  • Perform NCCL burn-in.
  • Perform HPL burn-in.
  • Perform NeMo™ burn-in.
  • Test storage.
Troubleshoot and Optimize 12%
  • Identify and troubleshoot hardware faults (e.g., GPU, fan, network card). 
  • Identify faulty cards, GPUs, and power supplies. 
  • Replace faulty cards, GPUs, and power supplies. 
  • Execute performance optimization for AMD and Intel servers. 
  • Optimize storage.
Official Information https://www.nvidia.com/en-us/learn/certification/ai-infrastructure-professional/