Staff IT SRE Engineer

NVIDIA

NVIDIA

IT
Bengaluru, Karnataka, India
Posted on Wednesday, October 25, 2023

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. NVIDIA is looking for an IT SW SRE Engineer, to join the IT team to support the R&D & Manufacturing activities. This job role will design, build, maintain, supervise and lead large scale production systems with high efficiency and availability using the combination of solutions, software and systems engineering practices on emphasizing on Manufacturing sites. This is a highly specialized subject area that demands knowledge across different systems, networking, coding, database, capacity management, continuous delivery, and deployment, and opensource cloud-enabling technologies like Kubernetes. Also, this job role will require to be a tech savvy, highly experienced in Linux platform with all the components that comes along with it.

We are looking for an IT SRE Engineer, SW for our IT Engineering team working out from India or Vietnam. As part of this team, you will be involved in exciting technical challenges by analyzing, solving, and designing vital services, platforms, and automations while always thinking about reliability, scalability, resilience, security, and performance. Also, be a part of the team responsible for helping to support uptime and availability of production critically important on-premises & cloud services distributed across multiple regions. You'll help to create more consistent, automated push button environments across all tiers, proactively test and tune all aspects of the infrastructure, streamline CI/CD processes, supervise, and respond to system notifications and alerts and continually work to pptimize and improve the performance, security, and reliability of our systems. Are you ready for this challenge?

What you'll be doing:

  • Work closely with other software engineers within the organization to identify and implement, build & packaging infrastructure requirements, automated tests to accelerate development for Autonomous Vehicles

  • Design and build high-end architectures and sophisticated solutions emphasis on Manufacturing Sites.

  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health, including pro-active actions based on reports and deep data analysis.

  • Able to address complicated, cross platform issues handling OS, storage, networking, database on-premises or in a cloud-based IaaS/PaaS/SaaS environment and handle live production incidents, debug/solve application, and infrastructure issues, follow and implement SRE standard methodologies.

  • Keep up-to date with security and proactively identify, diagnose, and solve sophisticated security issues.

What we need to see:

  • Proven experience mainly on automation of Infrastructure configuration and management as DevOps Engineer

  • Demonstrable experience in Containerization-Docker and orchestration (Kubernetes) – Required

  • Expert with large Scale project management and matrix management and the ability to handle several projects in parallel effectively, prioritize and implement in a fast-paced environment.

  • Experience with building and designing automations with standard methodology tools in the market.

  • Background with Infrastructure as a Code (Salt Stack, Puppet, Terraform, Ansible)

  • Experience with Cloud-based platforms (AWS, Google, Azure)

  • Proficient in building and managing highly available and scalable IT infrastructure, with knowledge on Docker/Virtualization, GIT, Gerrit, Perforce, Continuous Delivery, Continuous Monitoring, etc.

  • Should have the ability to communicate both verbally and in writing with users, vendors and management.

  • B.Sc. in an engineering field and 7+ years of experience

  • Highly experienced in Linux platform

Ways to stand out from the crowd:

  • Systematic problem-solving approach, coupled with strong interpersonal skills and a sense of ownership and drive.

  • Ability to debug and optimize code and automate routine tasks.

  • Well familiar with ITIL Module.

  • Experienced with Problem solving and decision making.

  • Deep understanding of Software Configuration Management (SCM) processes and tools such as Perforce, Git, Subversion, multi-site development

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.