Senior DevOps Engineer - HPC



Software Engineering
Multiple locations
Posted on Tuesday, June 4, 2024

NVIDIA is hiring a Senior DevOps Engineer to work on our HPC compiler team. We are building optimizing compilers that enable breakthrough science in an array of applications including weather forecasting, high-energy physics, computational fluid dynamics, materials science, life sciences, astrophysics, mechanical engineering and related fields. You will part of the team creating the infrastructure that builds, tests, packages, and delivers high quality software products.

What you'll be doing:

  • Evaluating, identifying, and driving solutions to extend and optimize infrastructure for the development, build, test, integration, and release of NVIDIA compilers

  • Designing and implementing infrastructure efficiency and usability improvements

  • Driving automation to monitor and gain insight into infrastructure, testing, and system health

  • Improving code quality and software architecture through code and design reviews

  • Collaborating with your peers and leadership to design, prioritize, plan, and implement systems that solve business needs

What we need to see:

  • A B.S. or M.S. in a computer science, computer engineering, or related field or equivalent experience

  • 8+ years of relevant DevOps infrastructure experience

  • Fluency in Python and Bash

  • Expert in Linux fundamentals

  • Experience architecting, setting up, and maintaining CI/CD workflows and infrastructure with Jenkins or other CI/CD software

  • Familiar with containers, provisioning, and HPC scheduling tools (e.g., Docker, Ansible, Slurm)

  • Highly motivated with excellent programming and problem-solving capabilities. You are passionate and curious about new technologies. You take pride in your work and strive to achieve incredible results and possess excellent communication, planning, and leadership skills.

Ways to stand out from a crowd:

  • Background in HPC systems, applications, or compiler development (e.g., LLVM)

  • Experience tuning HPC systems for performance benchmarking

  • Proficiency in version control systems (e.g. Perforce, Git / Gitlab) and SQL

  • Experience designing and maintaining complex software infrastructure for multi-platform environments

  • Team lead or team management experience

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 164,000 USD - 327,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.