DL Computing Performance Architect

NVIDIA

NVIDIA

IT
Shanghai, China
Posted on Jul 29, 2024

NVIDIA is developing processor and system architectures that accelerate machine learning, automotive and high performance computing (HPC) applications. We are looking for a technical expert to lead our DL performance projections and analysis effort. This position offers the opportunity to make a meaningful impact in a fast-moving, technology focused company.

What you'll be doing:

  • Establish DL applications and use-cases for analysis and projections.

  • Specify hardware/software configurations and metrics to analyze performance, power, accuracy and resiliency in uniprocessor and multiprocessor configurations

  • Create and maintain workloads and micro-benchmark suites.

  • Generate projections, comparisons and analysis reports for internal/external consumption.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software and product teams.

What we need to see:

  • 4+ years working experience on relevant industy.

  • Strong software skills with C/C++, Python, MPI, OpenMP etc.

  • Experience of DL workload and operator optimization and performance analysis will be a plus.

  • Familiarity with GPU computing and parallel programming models will be a plus.

  • Excellent oral and written communication skills.

  • Good organizational, time management and task prioritization skills.