Solutions Architect, Data Center MEP

NVIDIA

NVIDIA

IT
Multiple locations
Posted on Dec 23, 2024

NVIDIA is seeking a Senior MEP Engineer to join its Infrastructure Specialists team. Academic and commercial groups worldwide are using NVIDIA products to redefine deep learning, data analytics, and power data centers. Join the team building many of the world's largest and fastest data centers and supercomputers! NVIDIA is looking for someone who can lead planning and deployments of AI data centers focusing on data center infrastructure including power/cooling systems, telemetry and control systems, design and construction processes etc.

As the NVIS Solutions Architect for Datacenter Infrastructure, you will focus on supporting customers in the areas of data center planning, design, construction and deployment ensuring the integrity of NVIDIA platform infrastructure. Your primary goal will be to guarantee that all aspects of the data center's physical infrastructure are meticulously planned, implemented, and validated to meet NVIDIA reference architectures, operational requirements, and industry standards. This infrastructure includes architectural systems, power distribution, liquid/air cooling systems, integration of telemetry and control systems across DC layers and all other physical infrastructure. You will be working with product and engineering teams, customers and the partner/provider ecosystem to ensure successful deployments.

What you will be doing:

  • NVIS Data center deployment planning: Collaborate with product and engineering teams to understand NVIDIA’s reference architectures and guidance for data center infrastructure including power distribution, cooling systems, controls and monitoring and network/cabling architecture. Support customers and partners in translating this guidance rapidly into innovative and reliable data centers.

  • Design and construction oversight: Review and evaluate customers' and partners' infrastructure design plans, ensuring consistency with NVIDIA reference architecture, industry standards, and regulatory requirements. Provide feedback and recommendations to improve performance, scalability, and cost-effectiveness.

  • Pre-deployment audits and planning: Develop and implement comprehensive audit plans to assess data center infrastructure components' operational efficiency, reliability, and readiness prior to deployment of AI/HPC clusters. Conduct pre-deployment audits to identify potential issues, risks, and areas for improvement.

  • Partner ecosystem: Develop and sustain a strong ecosystem of service providers and partners as needed, to ensure customers’ can deploy NVIDIA solutions rapidly and reliably.

  • Be the key liaison for customers and partners on matters of data center infrastructure.

  • Act as the NVIS mentor providing guidance, mentorship, and support to ensure the team's success in their respective roles.

  • Quality Assurance: Establish and enforce quality assurance processes to verify that deployments meet established specifications and performance benchmarks. Conduct thorough bring-up, testing, and commissioning to validate the functionality and reliability of infrastructure components.

  • Continuous Improvement: Drive continuous improvement initiatives to enhance data center infrastructure reliability, resilience, and sustainability. Find opportunities to streamline processes, automate repetitive tasks, and leverage emerging technologies to optimize infrastructure operations.

  • Collaboration and Communication: Collaborate and communicate across internal teams, external vendors, and customers to facilitate the seamless integration of data center infrastructure solutions. Serve as a domain expert and point of contact for infrastructure-related inquiries and blocking issues.

What we need to see:

  • Bachelor's degree in Engineering, Computer Science, Information Technology, or a related field. Advanced degree or equivalent experience or relevant certifications preferred.

  • 8+ years of overall experience in enterprise and/or hyperscale data centers with focus on design and construction, preferably for sophisticated, high density AI/HPC data centers.

  • Validated experience in data center engineering, operations, or infrastructure management roles, focusing on large-scale data center deployments.

  • Strong technical knowledge and experience in data center systems - power distribution, liquid cooling, rack/server chassis and cabling

  • Demonstrated technical and project leadership under fluid situations.

  • Excellent analytical, problem-solving, and decision-making skills, keen attention to detail, and a commitment to quality.

  • Effective communication and interpersonal skills, with the ability to interact expertly with diverse collaborators including customers and facilitate productive discussions.

  • Organization & Time Management – able to plan, schedule, and coordinate tasks related to the job to achieve goals within or ahead of established time frames.

  • Willingness to travel (40%).

Relevant certifications

  • BICSI

  • CNCDP

  • FOA Data center - CFOSDC - certified fabrics optics specialist, Datacenter

  • Cnci – certifies network cable installer.

Way to stand out from the crowd:

  • Experience in data center operations process, safety, and security measures.

  • Solid understanding of whole data center Infrastructure stack

  • Outstanding interpersonal skills.

NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology. We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.