Deep Learning Performance Architect Intern



Shanghai, China
Posted on Thursday, June 27, 2024

NVIDIA is developing processor and system architectures that accelerate various deep learning applications. We are looking for an expert deep learning system performance architect to join our AI performance projection and analysis efforts. In this position, you will have a chance to work on performance projection, analysis, and optimization on state-of-the-art hardware architectures for various AI workloads. You will make your contributions to our dynamic technology focused company.

What you'll be doing:

  • Analyze state-of-the-art AI models on various GPU hardware platforms (e.g., Client (Desktop/Laptop) platforms and SoCs)

  • Identify performance bottlenecks and propose optimizations

  • Performance analysis of DL workloads (e.g., LLM)

What we need to see:

  • BS, MS or PhD students in relevant discipline (CS/EE/Math etc.,)

  • Experience with popular AI models (e.g., LLM and AIGC models)

  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow)

  • Knowledge and experience on hardware architectures for deep learning applications