Pre-Summer Sale Special - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: sntaclus

What is a direct benefit of using GPUDirect RDMA for multi-server workloads?

A.

Raises GPU base memory clock speeds.

B.

Offloads data movement from CPUs.

C.

Allows CPUs to prioritize scheduling.

D.

Compresses transferred data.

What NVIDIA tool should a data center administrator use to monitor NVIDIA GPUs?

A.

NVIDIA System Monitor

B.

NetQ

C.

DCGM

How many 1 Gb Ethernet in-band network connections are in a DGX H100 system?

A.

1

B.

2

C.

0

In training and inference architecture requirements, what is the main difference between training and inference?

A.

Training requires real-time processing, while inference requires large amounts of data.

B.

Training requires large amounts of data, while inference requires real-time processing.

C.

Training and inference both require large amounts of data.

D.

Training and inference both require real-time processing.

What is a key benefit of using NVIDIA GPUDirect RDMA in an AI environment?

A.

It increases the power efficiency and thermal management of GPUs.

B.

It reduces the latency and bandwidth overhead of remote memory access between GPUs.

C.

It enables faster data transfers between GPUs and CPUs without involving the operating system.

D.

It allows multiple GPUs to share the same memory space without any synchronization.

A company is implementing a new network architecture and needs to consider the requirements and considerations for training and inference. Which of the following statements is true about training and inference architecture?

A.

Training architecture and inference architecture have the same requirements and considerations.

B.

Training architecture is only concerned with hardware requirements, while inference architecture is only concerned with software requirements.

C.

Training architecture is focused on optimizing performance while inference architecture is focused on reducing latency.

D.

Training architecture and inference architecture cannot be the same.

What is the critical difference between Slurm and Kubernetes in AI infrastructure? Pick the 2 correct responses below.

A.

Slurm provides full replacement for cluster-wide container orchestration, service discovery, and management of long-running microservices.

B.

Slurm schedules queued batch and HPC workloads onto available compute resources using job queues and policies.

C.

Both platforms are limited to basic job status monitoring for running workloads and provide no additional orchestration capabilities.

D.

Kubernetes focuses only on per-node resource allocation for individual batch jobs without managing distributed services or containers.

Which solution should be recommended to support real-time collaboration and rendering among a team?

A.

A cluster of servers with NVIDIA T4 GPUs in each server.

B.

A DGX SuperPOD.

C.

An NVIDIA Certified Server with RTX-based GPUs.

Which NVIDIA tool aids data center monitoring and management?

A.

Mellanox Insight

B.

TensorRT

C.

Clara

D.

DCGM

What is the maximum number of MIG instances that an H100 GPU provides?

A.

7

B.

8

C.

4