Senior Deep Learning Software Engineer, Algorithmic Model Optimization
We are now looking for a Senior Deep Learning Software Engineer, for Algorithmic Model Optimization!
Join our team of algorithmic model optimization experts and take part in unlocking the biggest potential for AI with generative models such as large language models (LLM) and diffusion models. As a Senior Deep Learning Software Engineer, you will be at the forefront of pushing the boundaries of these models and enabling their deployment at a larger scale with unmatched efficiency. We are developing an innovative software platform that will not only be utilized internally but also have a significant impact externally by enabling the creation of groundbreaking AI products. This is an exceptional opportunity for passionate software engineers like you, who have a strong background in Deep Learning, to join us in solving the most significant challenges in the field.
Your role will be pivotal in our mission to maximize the potential of our rapidly expanding data center deployments. Additionally, you will play a crucial part in adopting a data-driven approach to hardware design and system software development. Collaboration is at the heart of what we do, and you will have the chance to work closely with a diverse range of teams at NVIDIA, including the Applied Deep Learning Research teams, CUDA Kernel and DL Framework development teams, and the Silicon Architecture Team. In this position, you will actively engage with internal stakeholders, users, and members of the open-source community. Your input will be vital in defining and implementing cutting-edge model optimization algorithms. The scope of your work will encompass researching and developing highly efficient search algorithms, defining public APIs, implementation, and various other software engineering tasks. We are seeking individuals who are as enthusiastic as we are about pushing the boundaries of AI and contributing to groundbreaking advancements in the field. If you are passionate about innovation, tackling complex DL problems, and working in a collaborative environment, this is the perfect opportunity for you. Join us, and together, we will shape the future of AI model optimization and its impact on the world.
What you’ll be doing:
Prototype and develop model optimization methods, and build a most impactful model optimization platform
Collaborate with internal and external partners to accelerate the adoption of deep learning model optimization
Stay up to date with the latest research and innovations in generative AI and model optimization techniques
Analyze and optimize the theoretical and practical performance of DL models generated
Publish findings in top AI conferences, and create Intellectual Property
What we need to see:
Masters, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field.
6+ years of relevant work or research experience in Deep Learning.
Excellent software design skills, including debugging, performance analysis, and test design
Strong algorithms and programming fundamentals
Ability to work independently, define project goals and scope, and run your own development effort
Good communication, documentation habits, and interpersonal skills
Experience with one or more: Python, C++, performance tuning
Ways to stand out from the crowd:
Contributions to PyTorch, JAX, or other Machine Learning Frameworks
Knowledge of GPU architecture and compilation stack, and capability of understanding and debugging end-to-end performance
Familiarity with Nvidia’s deep learning SDK such as TensorRT
Strong understanding of deep learning algorithms and solutions
Strong understanding of ML model optimization techniques such as quantization, pruning, distillation.
Increasingly known as “the AI computing company” and widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Are you creative, motivated, and love a challenge? If so, we want to hear from you! Come, join our model optimization group, where you can help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field.The base salary range is $176,000 - $333,500. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.