Senior Performance Analysis Engineer
We are seeking a talented and experienced performance analysis engineer to join our team. As a Performance Engineer focused on AI applications, you will play a crucial role in optimizing the performance of our Spectrum-X platform, the first networking platform for AI. Powered by the tight coupling of the NVIDIA Spectrum™-4 Ethernet switch and the NVIDIA® BlueField®-3 data processing unit (DPU), Spectrum-X delivers the highest performance for AI, machine learning, and natural language processing, as well as diverse industry applications.
What you will be doing:
Network Performance Analysis: Conduct in-depth analysis of network performance, including latency, throughput, and packet loss, using various monitoring tools and techniques. Identify bottlenecks and areas for improvement in the network infrastructure.
Performance Optimization: Develop strategies and implement solutions to optimize the performance of the network infrastructure which includes Switch, DPU and SW. Collaborate with cross-functional teams, including architects, AI engineers and system administrators, to implement and test performance-enhancing configurations.
Network Monitoring and Testing: Deploy and maintain network monitoring tools to continuously monitor network performance and proactively identify potential issues. Develop and execute performance testing methodologies to assess the impact of Product changes on AI application performance.
Troubleshooting and Issue Resolution: Investigate and resolve networking-related issues that impact AI application performance. Stay Abreast of Emerging Technologies: Keep up-to-date with the latest benchmarks, networking technologies, industry trends, and best practices related to AI application performance. Evaluate and recommend new tools, methodologies, and technologies that can enhance the efficiency and effectiveness of network performance optimization.
What we need to see:
Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field.
Solid experience in performance engineering, with a focus on AI applications.
8+ years of experience
Proficiency in network protocols, including TCP/IP, UDP, HTTP, and RDMA.
Experience with software design and development
Experience with network performance testing tools
Strong analytical and problem-solving skills to identify and resolve SW and HW performance issues.
Excellent communication skills, with the ability to collaborate effectively with cross-functional teams and present complex concepts to both technical and non-technical stakeholders.
Proactive and self-motivated, with the ability to work independently and prioritize tasks effectively.
Ways to stand out from the crowd:
Previous experience as a performance engineer
Strong Python and scripting skills
Experience with debugging HW like DPUs GPUs and CPUs
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.