Senior Software Test Development Engineer - Deep Learning
NVIDIA
We are looking for a highly experienced AI Senior Software Test development engineer in NVIDIA’s Deep Learning SWQA team. The position is in NVIDIA Deep Learning and AI Software Quality Assurance team that defines, develops and performs tests to validate robustness and measure the performance of NVIDIA‘s Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech recognition, natural language processing, and a wide variety of other AI scenarios. We collaborate with multiple AI product teams to develop new products; derive and improve complex test plans; and improve our workflow processes for a diverse range of GPU computing platforms. You should grow with being in the critical path supporting developers working for billion-dollar business lines as well as intimately understanding the values of responsiveness, thoroughness and collaboration. You should constantly champion and implement efficiency improvements across your domain. Join the team which is building software which will be used by the entire world!
What you’ll be doing:
Work closely with global multi-functional teams to understand the test requirements and take ownership of product quality.
Lead the validation of several AI products.
Plan/design/implement/report/automate test plan/test case/test reports.
Run bug lifecycle and co-work with inter-groups to work towards solutions.
Spearhead improvements to our Test Automation infrastructure; Orchestrate stronger testing processes for improved efficiency and accuracy through rapid automation, failure analysis, and improved reporting.
Architect and lead efficiency initiatives.
In-house repro and verify customer issues/fixes.
What we need to see:
BS or higher degree in CS/EE/CE or equivalent experience.
10+ years of software quality assurance or test automation background with knowledge of test infrastructure and strong analysis skills.
Scripting language (Python, Perl, Bash) knowledge and UNIX/Linux experience.
Good user/development experiences of virtualization like VM & Docker container
Familiarity with several automation frameworks like PyTest, Robot Framework or Selenium; Proficiency in programming with Python for test automation, data analysis, and reporting.
5+ years of working knowledge with any Deep Learning Framework (PyTorch, TensorFlow, etc.) and models (Llama3, Resnet, etc.) especially in end-to-end customer scenarios
Experience in building models and AI-based infrastructure to improve test automation.
Able to balance conflicting/changing priorities and maintain a positive attitude while experiencing ambitious and dynamic schedules.
Excellent English written and oral communication skills.
Ways to stand out from the crowd:
Familiarity with NVIDIA GPU hardware products (Tesla, Tegra, DGX, etc.).
Working knowledge of NVIDIA GPU Computing (CUDA) and CUDA libraries for Deep Learning.
Experience in validating Data Center GPU based infrastructure (multi-GPUs, multi-nodes, cluster)
Background in validating fault tolerance infrastructure
Experience in VectorCAST, Bullseye, Gcov, or Coverity tools.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.