HPC applications require computing power that can perform an enormous amount of calculations per second. Increasing the compute density of each server node dramatically reduces the number of servers required, resulting in huge savings in cost, power, and space consumed in the data center. For HPC simulations, high-dimension matrix multiplication requires a processor to fetch data from many neighbors for computation, making GPUs connected by NVLink ideal. A single NVIDIA HGX A100 4-GPU server replaces over 100 CPU-based servers running the same scientific applications.