Senior Software Architect - Data Center Platform Simulation and Virtualization

NVIDIA

NVIDIA

Software Engineering, IT
Multiple locations
Posted on Tuesday, July 2, 2024

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Software Architect who will help build & own system architecture of simulators for our DGX and HGX Server platforms. Simulations play a significant role in building scalable systems at Speed of Light! You will work with world class engineering teams across HW and SW and engage with industry leading cloud service providers on taking these products to market.

What you’ll be doing:

  • Drive requirements, architecture, and roadmap of NVIDIA DGX Simulation platforms.

  • Engage with major customers to understand their requirements and work to align with their roadmap and adoption strategy.

  • Work closely with hardware modeling, kernel & platform driver teams distributed globally.

  • Build and deliver full server simulation platform to internal and external NVIDIA partners.

  • Mentor architects and engineering teams to grow them into future leaders.

  • Make key technical decisions even when faced with ambiguity and mitigate execution risks by following left shift strategy.

What we need to see:

  • BS degree or higher (Computer science or related) or equivalent experience

  • 10+ years of relevant experience in virtualization and HW simulation/emulation technologies.

  • Proven experience in designing architecture for scalable and performant server systems, particularly at the SW/HW interface.

  • Previous experience around hardware interfaces such as PCIe, SPI, I3C etc with Linux boot solutions on x86 & ARM class platforms.

  • Good understanding of hypervisors & HW emulators, like Qemu, KVM, VDK, SIMICs etc.

  • Experience in Out of Band and Inband management architectures.

  • Proficient in C / C++ with strong software development, optimization, user & kernel mode debugging skills.

  • Strong interpersonal & communication skills to work with a globally distributed engineering team.

Ways to stand out from the crowd:

  • Experience in building left shift strategy around HW & SW stack bringup using Simulators & Emulators etc.

  • Contribution in Qemu/KVM opensource repositories.

  • Experience in Verilog and SystemC

  • Knowledge of device management protocols such as MCTP, PLDM and RDE.

  • Knowledge of system management protocols such as Redfish and IPMI.

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.