Four layersone stack#
GPU at the bottom, twin at the top, building and cluster between. Each layer is a published NVIDIA product. The novelty is running them as one coupled control surface.
DCGM is table stakes. Metropolis applied to a data centre as a smart building is more interesting. DSX Blueprint extended to the thermal coupling layer is genuinely new.
The four-layer stack is built from existing NVIDIA platform products. At the GPU layer, DCGM and NVML provide telemetry for power, temperature, utilisation, and per-tenant attribution. At the cluster layer, Mission Control handles fleet orchestration, GPU lifecycle, workload scheduling, and observability across pods. At the building layer, Metropolis treats the facility as a managed asset, fusing sensor data from BMS, leak detection, ingress monitoring, and security cameras. At the twin layer, Omniverse with DSX Blueprint models the data centre's physical state as a real-time simulation.
The bridge between the IT and facility worlds runs on Jetson and IGX Orin at the cabinet edge. These read facility sensors, push BMS state up into Metropolis, and execute control loops back down into chillers, valves, and CDU setpoints. Every cabinet is a Jetson node. The fleet is observable and controllable from one place.
What makes this architectural rather than incremental: the twin layer simulates physical processes that the published DSX Blueprint reference does not currently cover. Heat flow into a host digester loop. Biogas yield response to thermal demand. PHE saturation under workload shifts. Dry-cooler engagement under host-loop drop-off. Building this extension is the white-paper opportunity.